y 



2 

CGA GTA AAT CGT TCT CGA AGC CTT AGT AAT 
arg val asn arg ser arg ser leu ser asn 

62 

TCA CCA GAT GAT GAA GTT CGA TCA ATC ATC 
ser pro asp asp glu val arg ser ile ile 

122 

TGG GTT AAC ACT GGT GGT CCA AAA GCT GCC 
trp val asn thr gly gly pro lys ala ala 

182 

GAA TCA ACA CAG GCT ATG GAT CGA AGT TGT 
glu ser thr gin ala met asp arg ser cys 

242 

AGT TTC TTA CAA ACA TTA ACG GGA CGC TTA 
ser phe leu gin thr leu thr gly arg leu 

302 

GCT TTG CAG TGG GTT GTT TGC AGT GGC AGC 
ala leu gin trp val val cys ser gly ser 

362 

TTC TTT TTT GAA TTA ATG GTA AAG AGC ATG 
, phe phe phe glu leu met val lys ser met 

Cadherin 
422 I XXX cleavage xx| 

GAG GCT CCA AGG AAA AGT CGT TTT CCA GAA 
glu ala pro arg lys ser arg phe pro glu 

482 

AGC ACG ATT GCT AGT GAT ATA GTT TCA CGA 
ser thr ile ala ser asp ile val ser arg 

542 

CTC AAT ACA AGC CTT GCA TTC TTT CTC AAT 
leu asn thr ser leu ala phe phe leu asn 

602 

GTT TTT AGC CTT ATA AAG TCC TGC TAT AAA 
val phe ser leu ile lys ser cys tyr lys 

662 

AAT CCC AGT GTT CTG GTG TCC TTG AGG CTG 
asn pro ser val leu val ser leu arg leu 

722 

CAC TAT GTT ACA TTA AAC TTA CCC TGC AGC 
his tyr val thr leu asn leu pro cys ser 

782 

TCT GTT TCT TCT GCA ACA TCT CAG AGT TCT 
ser val ser ser ala thr ser gin ser ser 

842 

ATT GCA AAT ATG TTT GAA TTA TCC GTG CCT 
ile ala asn met phe glu leu ser val pro 



1 
A 

32 

AGC AAT CCA GAT ATA TCT GGG ACT CCC ACG 
ser asn pro asp ile ser gly thr pro thr 

92 

GGG AGT AAG GGT TTA GAT CGC TCC AAT TCC 
gly ser lys gly leu asp arg ser asn ser 

152 

CCA TGG GGA TCC AAC CCC AGT CCA AGT GCA 
pro trp gly ser asn pro ser pro ser ala 

212 

AAT CGT ATG TCT TCG CAC ACA GAG ACG TCA 
asn arg met ser ser his thr glu thr ser 

272 

CCA ACT AAA AAG CTT TTT CAC GAG GAG CTG 
pro thr lys lys leu phe his glu glu leu 

332 

GTT CGG GAA TCA GCT TTG CAA CAA GCC TGG 
val arg glu ser ala leu gin gin ala trp 

392 

GTG CAC CAT TTA TAC TTT AAT GAT AAA CTT 
val his his leu tyr phe asn asp lys leu 

452 

CGT TTC ATG GAT GAC ATT GCA GCT CTT GTC 
arg phe met asp asp ile ala ala leu val 

512 

TTT CAG AAG GAC ACA GAA ATG GTT GAG AGA 
phe gin lys asp thr glu met val glu arg 

572 

GAT CTG TTG TCT GTT ATG GAC AGA GGA TTT 
asp leu leu ser val met asp arg gly phe 

632 

CAG GTG TCT TCA AAG CTT TAC TCA TTA CCG 
gin val ser ser lys leu tyr ser leu pro 

692 

GAT TTT CTA CGA ATC ATC TGC AGT CAT GAG 
asp phe leu arg ile ile cys ser his glu 

752 

TTA CTT ACT CCA CCT GCA TCT CCA TCA CCT 
leu leu thr pro pro ala ser pro ser pro 

812 

GGA TTT TCT ACG AAT GTA CAA GAC CAA AAG 
gly phe ser thr asn val gin asp gin lys 

872 

TTC CGC CAA CAG CAT TAT TTG GCA GGA CTT 
phe arg gin gin his tyr leu ala gly leu 



-7 
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Cadherin 
I XX EC motif XX I 

GTG TTA ACA GAG CTG GCT GTC ATT TTA gac 
val leu thr glu leu ala val ile leu asp 

962 

AAG AAA GTC ATC AAT ATG GTA CAC AAT TTA 
lys lys val ile asn met val his asn leu 

1022 

TCT GAC CCT CAG ATA AAG GCT CGA GTG GCC 
ser asp pro gin ile lys ala arg val ala 

1082 

ATG GAA ACT GTA CCT CAG CTG TAT GAT TTT 
met glu thr val pro gin leu tyr asp phe 

1142 

ATT TGT ATA GCC ACT GAT GAT TAT GAA AGT 
ile cys ile ala thr asp asp tyr glu ser 

1202 

GCC ATG GCA ATC GCA GGG ACA TCG GTC CCT 
ala met ala ile ala gly thr ser val pro 

1262 

ACG TCA ACG AGT GGC AGG CAA CAC ACT ACC 
thr ser thr ser gly arg gin his thr thr 

1322 

ATC TGT CTA CTT TGG GTT CTC AAA AAT GCA 
ile cys leu leu trp val leu lys asn ala 

1382 

GAT CTC TCA GTC TTG CAG CTA AAC CGG CTA 
asp leu ser val leu gin leu asn arg leu 

1442 

TTT GAG TAT AAA GGG AAA AAA GTG TTT GAA 
phe glu tyr lys gly lys lys val phe glu 

1502 

AAA GAC ATG AGA GCA AAG CTT GAA GAA GCT 
lys asp met arg ala lys leu glu glu ala 

1562 

ATG GTA CGG CGA AGC CGA GGA CAG CTC GAG 
met val arg arg ser arg gly gin leu glu 

1622 

CAA GAA AAT TTG AGG TGG AGG AAA GAT ATG 
gin glu asn leu arg trp arg lys asp met 

1682 

GAC AAA TCA AGA GCA GAG ATT GAA CAC GAA 
asp lys ser arg ala glu ile glu his glu 

1742 

GCA AAC CTA ATC ATT TTA GAT ACA TTA GAG 
ala asn leu ile ile leu asp thr leu glu 

1802 

TCC AAA GAG AGC ATT CTT GGT GGA GTG CTA 
ser lys glu ser ile leu gly gly val leu 



932 

cct gat get gaa gga ctg TTT GGA TTG CAT 
pro asp ala glu gly leu phe gly leu his 

992 

CTC TCC AGT CAC GAC TCA GAC CCG CGG TAC 
leu ser ser his asp ser asp pro arg tyr 

1052 

ATG TTG TAT CTA CCT CTG ATT GGT ATT ATC 
met leu tyr leu pro leu ile gly ile ile 

1112 

ACA GAA ACT CAC AAT CAA CGA GGA AGA CCA 
thr glu thr his asn gin arg gly arg pro 

1172 

GAG AGC GGA AGT ATG ATA AGC CAG ACC GTT 
glu ser gly ser met ile ser gin thr val 

1232 

CAA CTA ACA AGG CCT GGC AGT TTC CTC CTC 
gin leu thr arg pro gly ser phe leu leu 

1292 

TTT TCA GCA GAA TCA AGT CGA AGC CTT TTG 
phe ser ala glu ser ser arg ser leu leu 

1352 

GAT GAA ACA GTT CTA CAG AAG TGG TTT ACA 
asp glu thr val leu gin lys trp phe thr 

1412 

TTA GAT CTG CTT TAT CTC TGT GTG TCT TGC 
leu asp leu leu tyr leu cys val ser cys 

1472 

CGA ATG AAT AGC TTG ACC TTT AAG AAA TCA 
arg met asn ser leu thr phe lys lys ser 

1532 

ATT CTT GGG AGC ATA GGT GCC AGG CAA GAA 
ile leu gly ser ile gly ala arg gin glu 

1592 

AGA AGC CCA TCT GGA AGT GCC TTT GGA AGT 
arg ser pro ser gly ser ala phe gly ser 

1652 

ACT CAC TGG CGT CAA AAC ACA GAG AAG CTT 
thr his trp arg gin asn thr glu lys leu 

1712 

GCA CTG ATT GAT GGA AAC CTG GCT ACA GAA 
ala leu ile asp gly asn leu ala thr glu 

1772 

ATT GTT GTT CAG ACC GTT TCT GTA ACG GAA 
ile val val gin thr val ser val thr glu 

1832 

AAA GTG CTA CTA CAC AGC ATG GCC TGT AAC 
lys val leu leu his ser met ala cys asn 
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1862 1892 

CAA AGT GCA GTT TAT CTA CAA CAC TGT TTT GCT ACA CAG AGA GCC TTG GTT TCA AAG TTT 
gin ser ala val tyr leu gin his cys phe ala thr gin arg ala leu val ser lys phe 

1922 1952 

CCT GAA CTC TTA TTT GAA GAA GAG ACA GAG CAG TGT GCT GAT TTA TGC CTC AGG CTT CTC 
pro glu leu leu phe glu glu glu thr glu gin cys ala asp leu cys leu arg leu leu 

1982 2012 

CGA CAC TGT AGC AGT AGC ATC GGT ACA ATA CGG TCA CAC CCC AGT GCC TCC CTT TAC CTA 
arg his cys ser ser ser ile gly thr ile arg ser his pro ser ala ser leu tyr leu 

2042 2072 

CTA ATG AGG CAA AAC TTT GAG ATT GGG AAT AAC TTT GCC AGG GTT AAA ATG CAG GTA CCA 
leu met arg gin asn phe glu ile gly asn asn phe ala arg val lys met gin val pro 

2102 2132 

ATG TCA CTA TCC TCC TTG GTG GGC ACA TCT CAG AAT TTT AAT GAA GAA TTC TTA AGA CGT 
met ser leu ser ser leu val gly thr ser gin asn phe asn glu glu phe leu arg arg 

2162 2192 

TCT CTA AAG ACT ATA TTG ACA TAT GCT GAA GAA GAT CTG GAA TTG AGG GAA ACA ACA TTT 
ser leu lys thr ile leu thr tyr ala glu glu asp leu glu leu arg glu thr thr phe 

2222 2252 

CCT GAT CAG GTC CAG GAT CTG GTT TTC AAT CTC CAT ATG ATT CTT TCT GAT ACT GTG AAA 
pro asp gin -val gin asp leu val phe asn leu his met ile leu ser asp thr val lys 

2282 2312 

ATG AAG GAA CAC CAG GAG GAT CCT GAA ATG TTG ATT GAT CTA ATG tac aga att gcc aag 
met lys glu his gin glu asp pro glu met leu ile asp leu met tyr arg ile ala lys 

2342 2372 

ggt tac CAG ACC TCT CCA GAT CTG CGA TTG ACC TGG TTG CAG AAC ATG GCA GGC AAG CAC 
gly tyr gin thr ser pro asp leu arg leu thr trp leu gin asn met ala gly lys his 



24 02 I xxxxxxxxxxxxxxxxxx transmembrane domain xxxxxxxxxxxxxxxxxxx 

TCA GAA CGA AGC AAT CAT GCT GAA GCT GCA CAG TGT CTA GTC CAC TCA GCA GCA CTT GTT 
ser glu arg ser asn his ala glu ala ala gin cys leu val his ser ala ala leu val 



xxxxxxxxxxxxxxx I 24 92 

GCT GAA TAT TTG AGC ATG CTG GAG GAC CGG AAA TAT CTT CCT GTG GGA TGT GTA ACA TTT 
ala glu tyr leu ser met leu glu asp arg lys tyr leu pro val gly cys val thr phe 

2522 2552 

CAG AAT ATT TCA TCT AAT GTT TTA GAA GAA TCT GCG GTC TCA GAT GAT GTG GTA TCT CCA 
gin asn ile ser ser asn val leu glu glu ser ala val ser asp asp val val ser pro 

2582 2612 

GAT GAA GAA GGT ATC TGC TCT GGA AAA TAC TTT ACT GAG TCA GGA CTT GTG GGA TTA CTG 
asp glu glu gly ile cys ser gly lys tyr phe thr glu ser gly leu val gly leu leu 

2642 2672 | xxx ITAM xxxx | | xx 

GAA CAA GCA GCT GCT TCC TTC TCT ATG GCT GGC ATG TAT GAA GCA GTT AAT GAA GTT TAC 
glu gin ala ala ala ser phe ser met ala gly met tyr glu ala val asn glu val tyr 

X ITAM xxx I 2732 

AAA GTA CTT ATT CCT ATT CAT GAA GCT AAT CGG GAT GCA AAG AAA CTA TCC ACA ATT CAT 
lys val leu ile pro ile his glu ala asn arg asp ala lys lys leu ser thr ile his 

2762 2792 

GGT AAA CTT CAA GAA GCA TTC AGC AAA ATT GTT CAT CAG AGT ACT GGC TGG GAG CGG ATG 
gly lys leu gin glu ala phe ser lys ile val his gin ser thr gly trp glu arg met 
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2822 |xxxxx ITAM xxxx | 2852 

TTT GGC ACC TAT TTT CGT GTT GGT TTT TAT GGA ACC AAG TTC GGG GAT TTG GAT GAA CAA 
phe gly thr tyr phe arg val gly phe tyr gly thr lys phe gly asp leu asp glu gin 

2882 2912 

GAA TTT GTT TAG AAG GAG CCT GCA ATA ACC AAA CTT GCA GAG ATA TCT CAC AGA TTG GAG 
glu phe val tyr lys glu pro ala ile thr lys leu ala glu ile ser his arg leu glu 

2942 2972 

GGA TTT TAG GGA GAA AGA TTT GGA GAG GAT GTG GTT GAA GTA ATC AAA GAC TCT AAT CCT 
gly phe tyr gly glu arg phe gly glu asp val val glu val ile lys asp ser asn pro 

3002 3032 

GTA GAC AAG TGT AAA TTA GAT CCT AAC AAG GCA TAT ATT CAG ATT ACC TAT GTG GAG CCA 
val asp lys cys lys leu asp pro asn lys ala tyr ile gin ile thr tyr val glu pro 

3062 3092 

TAG TTT GAC ACA TAT GAG ATG AAG GAC AGA ATC ACC TAT TTC GAC AAA AAT TAC AAT CTT 
tyr phe asp thr tyr glu met lys asp arg ile thr tyr phe asp lys asn tyr asn leu 

3122 3152 

CGT CGA TTC ATG TAC TGT ACA CCC TTT ACT TTA GAT GGC CGT GCC CAT GGG GAA CTT CAT 
arg arg phe met tyr cys thr pro phe thr leu asp gly arg ala his gly glu leu his 

3182 3212 

GAA CAA TTC AAA AGG AAG ACC ATT CTG ACT ACG TCT CAT GCC TTT CCT TAT ATT AAA ACA 
glu gin phe lys arg lys thr ile leu thr thr ser his ala phe pro tyr ile lys thr 



■^^'^2 3272 I xxxxxxxxxxxxxxxxxxxxxxx 

AGG GTC AAT GTC ACT CAT AAA GAA GAG ATC ATC TTA ACA CCA ATT GAA GTT GCT ATT GAG 
arg val asn val thr his lys glu glu ile ile leu thr pro ile glu val ala ile glu 

xxxxxxxxxxxxxxxxxxxxxxxxxxxxx Coiled coil -1 xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 
GAC ATG CAG AAA AAG ACA CAG GAG TTG GCA TTT GCA ACA CAT CAG GAT CCC GCA GAC CCC 
asp met gin lys lys thr gin glu leu ala phe ala thr his gin asp pro ala asp pro 

xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx I 33 92 

AAA ATG CTT CAG ATG GTA CTC CAG GGA TCT GTA GGC ACC ACA GTG AAT CAG GGG CCT TTG 
lys met leu gin met val leu gin gly ser val gly thr thr val asn gin gly pro leu 

3422 3452 

GAA GTT GCC CAG GTT TTT CTG TCT GAA ATA CCT AGT GAC CCA AAG CTC TTC AGA CAT CAT 
glu val ala gin val phe leu ser glu ile pro ser asp pro lys leu phe arg his his 

■^^^2 3512 I xxxxxxxxxxx 

AAT AAA CTG CGA CTC TGC TTT AAA GAT TTT ACT AAA AGG TGT GAA GAT GCC TTA AGA AAA 
asn lys leu arg leu cys phe lys asp phe thr lys arg cys glu asp ala leu arg lys 

xxxxxxxxxxxxxxxxxxxxxx Coiled coil 2 xxxxxxxxxxxxxxxxxxxxxxxxxxxxxx | 

AAT AAG AGC TTA ATT GGG CCG GTT CAA AAG GAG TAT CAA AGG GAA TTG GGG AAA CTA TCT 

asn lys ser leu ile gly pro val gin lys glu tyr gin arg glu leu gly lys leu ser 

3602 3632 

TCG CCT TAA AGA GGC CCT ACA GCC CTA GAT CAC AGA AAG TCC CTC AGT TAT CCA AGC CAG 
ser pro STP 

3662 3692 

TAT TGC TTG TCC CCT GCC ACA GAG ATT CCT TCA GTC GAA TGA GCT TTC GCA AAA TGG ATC 

3722 3752 

TCT AAA CTG AAT GCA CTT GTT TTA TTC ATC TGC AAA GAG CCA TGT ATT CAA CAT CGA GTG 

3782 3812 

TGA AAA GAT CTA TTG GAA ACC AAC ATG GAA TGG AAT TCT GGA AAT TAT TAT TCA TTG AAG 
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3842 

AAT GCA GTG GCC 
3902 

GTT TCT AAT GTT 
3962 

CAC TAT ACA TTT 
4022 

TCG TAT TTT TTA 
4082 

TAA TGC TTA TTT 



AAG AAA ATA TCA 
CGG GTA ACA AGC 
ACC ATT ATT TAT 
ATT TAT ATC ACC 
TAA AGG TAG TAA 



3872 

AAT GTA GAT TGT 
3932 

TGT TAT CTT TTA 
3992 

ACC ATA GCT AAG 

4052 
ATT TAT AGA TTC 

4112 
AAA ATA TGT GAA 



TAA CGC TTG AGA 
AGA CAT TTT AAT 
GTT AAA AAT TTA 
ATT TTG GAC CCA 
TGT TTA CCT CGT 



ATC ATG GCT ATG 
GAC TCA AAG GTA 
TTC ACT TTA AGT 
TTT TAA ATG TAG 
GCG CGC CAG GGC 



4142 
CTC 
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ASGNLDKNARFSAIYRQDSNKLSNDDMLKLLADFRKPEKMAKLPVILGNLDITIDNVSSD 



FPNYVNSSYIPTKQFETCSKTPITFEVEEFVPCIPKHTQPYTIYTNHLYVYPKYLKYDSQ 



^j^j^jjjjQ^pgp I K 

KSFAKARNIAICIEFKDSDEEDSQPLKCIYGRPGGPVFTRSAFAAVLHHHQNPEFYDEIK 



lELPTQLHEKHHLLLTFFHVSCDNSSKGSTKKRDWETQVGYSWLPLLKDGRWTSEQHI 
lELPTQLHEKHHLLLTFFHVSCDNSSKGSTKKRDWETQVGYSWLPLLKDGRWTSEQHI 



PVSANLPSGYLGYQELGMGRHYGPEIKWVDGGKPLLKISTHLVSTVYTQDQHLHNFFQYC 
PVSANLPSGYLGYQELGMGRHYGPEIKWVDGGKPLLKISTHLVSTVYTQDQHLHNFFQYC 



QKTESGAQALGNELVKYLKSLHAMEGHVMIAFLPTILNQLFRVLT-RATQEEVAVNVTRV 
QKTESGAQALGNELVKYLKSLHAMEGHVMIAFLPTILNQLFRVLT-RATQEEVAVNVTRV 



MEIQVLIRFLSVILMQLFWVLPNMIHEDDVPISCPMV 

MSFLPIILNQLFKVLV-QNEEDEITTTVTRV 

NRSRSLSNSNPDISGTPTSPDDEVRSIIGSKGLDRSNSWVNTGGPKAAPWGSNPSPSAES 



IIHWAQCHEEGLESHLRSYVKYAYKAEPYVASEYKTVHEELTKSMTTILKPSADFLTSN 
IIHWAQCHEEGLESHIiRSYVKYAYKAEPYVASEYKTVHEELTKSMTTILKPSADFLTSN 



LFHIVSKCHEEGLDSYLSSFIKYSFRPGKPSAPQAPLIHETLATMMIALLKQSADFLAIN 

LPDIVAKCHEEQLDHSVQSYIKFVFKTR ACKERPVHEDLAKNVTGLLK- SNDSPTVK 

TQAMDRSCNRMSSHTETSSFLQTLTGRLP TKKLFHEELALQWWCSG- -SVR E 
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Cadherin 
Cl eavag e 



KLLRYSWFFFDVLIKSMAQHLIENSKVKLL 
KLLKYS WFFFD VL I KSMAQHL I ENS KVKLL 


^qr: 


r'pasyhhaaetwnmlmphitqkfgd 
^pasyhhavetwnmlmphitqkfrd 


KLLKYSWFFFEIIAKSMATYLLEENKIKLT 
HVLKHSWFFFAIILKSMAQHLIDTNKIQLP 
SALQQAWFFFELMVKSMVHHLYFNDKLEAP 


^gqr: 
^pqr: 
^ksr 


^PKAYHHALHSLFLAIT- IVESQYAE 
^ PE S YQNE LDNL VMVLS DHV I WKYKD 
J" PERFMDD I AALVS T I AS D I VS RFQK 



NPEASKNANHSLAVFIKRCFTFMDRGFVFKQIN NYIS--CFAPGDPKTLFEYKFEFL 

NPEASKNANHSLAVFIKRCFTFMDRGFVFKQIN NYIS - - CFAPGDPKTLFEYKFEFL 



IPKESRNVNYSLASFLKCCLTLMDRGFVFNLIN DYIS- -GFSPKDPKVLAEYKFEFL 

ALEETRRATHSVARFLKRCFTFMDRGCVFKMVN NYIS- -MFSSGDLKTLCQYKFDFL 

DTEMVERLNTSLAFFLNDLLSVMDRGFVFSLIKSCYKQVSSKLYSLPNPSVLVSLRLDFL 



RWCNHEHY I PLNLPM PFGKGRIQR YQDLQL D YSLTDEF 

RWCNHEHYIPLNLPM PFGKGRIQR YQDLQL DYSLTDEF 



QT I CNHEH Y IPLNLPM AFAKP KLQR VQDSNL EYSLSDEY 

QEVCQHEHFIPLCLPIRSANIPDPLTPSES TQELHASDMPEYSVTNEF 

RIICSHEHYVTLNLPCSLLTPPASPSPSVSSATSQSSGFSTNVQDQKIANMFELS- -VPF 
MNADTAPTSPCPSIS SQNSSSCSSFQDQKIASMFDRTSRVPA 

Cadherin 



EC motif 



CRNHFLVGL 
CRNHFLVGL 


LLRE 
LLRE 


VGTALQEFRE VRLIAISVLKNLLIKHSFDDRYASRSHQARIAT 

VGTALQE FRE VRL I A I S VL KNLL I KHS FDDR YASRSHQAR I AT 


CKHHFLVGL 
CRKHFLIGI 
RQQHYLAGL 
SSTS-SPGL 


LLRE 
LLRE 
VLTV 
LFTE 


TS I ALQDNYE 1 R YTA I S VI KNLL I KHAFDTRYQHKNQQAKI AQ 

VGFALQEDQD VRHLALAVLKNLMAKHS FDDRYREPRKQAQI AS 

LAVILDPDAEGLFGLHKKVINMVHNLLSSHDSDPRYSDPQIKARVAM 
LAAALDAEGEGISEVQRKAVSAIHSLLSSHDLDPRCVKPEVKVKIAA 



LYLPLFGLLIENVQRINVRDVSPFPVNAG-MTVKDESLALPAVNPLVTPQKGSTLDNSLH 
LYLPLFGLLIENVQRINVRDVSPFPVNAG-MTVKDESLALPAVNPLVTPQKGSTLDNSLH 



LYLPFVGLLLENIQRLAGRDTLYSCAAMPNSASRDEFPCG FTSP- -AN- -RGSLS 

LYMPLYGMLLDNMPRIYLKDLYPFTVNTSNQGSRDDLSTNGGFQSQTAIKHANSVDTSFS 

LYLPLIGIIMETVPQLYDFTETHNQRGRPICIATDDYESE SG SMIS 

LYLPLVGIILDALPQLCDFTVADTRRYR TSGSDEEQE GA GAIT 



KDLLGAISGIASPYTTSTPNINSVRNADSRGSLISTDSGNSLPERNSEKSNSLDKHQQSS 
KDLLGAISGIASPYTTSTPNINSVRNADSRGSLISTDSGNSLPERNSEKSNSLDKHQQSS 



TDKDTAYGSFQNG HGIKREDSRGSLIP-EGATGFPDQGNTGEN TRQS 

KDVLNSIAAFSS lAISTVNHADSRASLASLDSNPSTNEKSSEKTDNCEKIPRPL 

QTVAMAI AGTS VPQ LTRPGS FLLTSTSGRQHT 

QNVALAIAGNNFN LKTSG- IVLSSLPYKQYN 



TLGNSWRCDKLDQSEIKSLLMCFLYILKSMSDDALFTYWN-KASTSELMDFFTISEVCL 
TLGNSWRCDKLDQSEIKSLLMCFLYILKSMSDDALFTYWN-KASTSELMDFFTISEVCL 



STRSSVSQYNRLDQYEIRSLLMCYLYIVKMISEDTLLTYWN-KVSPQELINILILLEVCL 
ALIGSTLRFDRLDQAETRSLLMCFLHIMKTISYETLIAYWQ-RAPSPEVSDFFSILDVCL 

TFSAESSRSLLICLLWVLKN-ADETVLQKWFTDLSVLQLNRLLDLLYLCV 

MLNADTTRNLMICFLWIMKN-ADQSLIRKWIADLPSTQLNRILDLLFICV 
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HQFQYMGKRYIARNQEGLG- - PIVHDRKS QTLPVSRNRTGMM 

HQFQYMGKRYI AR TGMM 

FHFRYMGKRNIARVHDAWLSKHFGIDRKS QTMPALRNRSGVM 

QNFRYLGKRNIIRKIAAAF - - KFVQSTQNNGTLKGSNPSCQTSGLLAQWMHSTSRHEGHK 

SCFEYKGKKVFERMNSLTFK--KSKDMRAK LEEAILGSIGARQEMV 

LCFEYKGKQSSDKVSTQVLQ- - KSRDVKAR LEEALLRGEGARGEMM 



HARLQQL GSLDNS LTFNHSYGHSDADVLHQSLLEANIATEVC 

HARLQQL GSLDNS LTFNHSYGHSDADVLHQSLLEANIATEVC 

QARLQHL SSLESS FTLNHSSTTTEADIFHQALLEGNTATEVS 

QHRSQTLPIIRGK NALSNPKL LQMLDNTMTSNSNEIDIVHHVDTEANIATEGC 

RRSRGQLERSPSGSAFGSQENLRWRKDMTHWRQNTEKLDKSRAEIEHEALIDGNLATEAN 
RRRAPGNDRFP GLNENLRWKKEQTHWRQANEKLDKTKAELDQEALISGNLATEAH 



LTALDTLSLFTLAFKNQLLADHGHNPLMKKVFDVYLCFLQKHQSETALKNVFTALRSLIY 
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1 

A 



2 32 

CGA GTA AAT CGT TCT CGA AGC CTT AGT AAT AGC AAT CCA GAT ATA TCT GGG ACT CCC ACG 
62 92 

TCA CCA GAT GAT GAA GTT CGA TCA ATC ATC GGG AGT AAG GGT TTA GAT CGC TCC AAT TCC 
122 152 

TGG GTT AAC ACT GGT GGT CCA AAA GCT GCC CCA TGG GGA TCC AAC CCC AGT CCA AGT GCA 
182 212 

GAA TCA ACA CAG GCT ATG GAT CGA AGT TGT AAT CGT ATG TCT TCG CAC ACA GAG ACG TCA 

met asp arg ser cys asn arg met ser ser his thr glu thr ser 

242 272 

AGT TTC TTA CAA ACA TTA ACG GGA CGC TTA CCA ACT AAA AAG CTT TTT CAC GAG GAG CTG 
ser phe leu gin thr leu thr gly arg leu pro thr lys lys leu phe his glu glu leu 

302 332 

GCT TTG CAG TGG GTT GTT TGC AGT GGC AGC GTT CGG GAA TCA GCT TTG CAA CAA GCC TGG 
ala leu gin trp val val cys ser gly ser val arg glu ser ala leu gin gin ala trp 

ref 1.1 and 1.2 I 
362 ref 2.1 and 2.2^ 392 

TTC TTT TTT GAA TTA ATG GTA AAG AGC ATG GTG CAC CAT TTA TAC TTT AAT GAT AAA CTT 
phe phe phe glu leu met val lys ser met val his his leu tyr phe asn asp lys leu 



422 452 

GAG GCT CCA AGG AAA AGT CGT TTT CCA GAA CGT TTC ATG GAT GAC ATT GCA GCT CTT GTC 
glu ala pro arg lys ser arg phe pro glu arg phe met asp asp ile ala ala leu val 

482 512 

AGC ACG ATT GCT AGT GAT ATA GTT TCA CGA TTT CAG AAG GAC ACA GAA ATG GTT GAG AGA 
ser thr ile ala ser asp ile val ser arg phe gin lys asp thr glu met val glu arg 

542 572 

CTC AAT ACA AGC CTT GCA TTC TTT CTC AAT GAT CTG TTG TCT GTT ATG GAC AGA GGA TTT 
leu asn thr ser leu ala phe phe leu asn asp leu leu ser val met asp arg gly phe 

602 632 

GTT TTT AGC CTT ATA AAG TCC TGC TAT AAA CAG GTG TCT TCA AAG CTT TAC TCA TTA CCG 
val phe ser leu ile lys ser cys tyr lys gin val ser ser lys leu tyr ser leu pro 

■ref 3 . 1 and 3 . 2 
662 T, 692 

AAT CCC AGT GTT CTG^GTG TCC TTG AGG CTG GAT TTT CTA CGA ATC ATC TGC AGT CAT GAG 
asn pro ser val leu val ser leu arg leu asp phe leu arg ile ile cys ser his glu 



722 752 

CAC TAT GTT ACA TTA AAC TTA CCC TGC AGC TTA CTT ACT CCA CCT GCA TCT CCA TCA CCT 
his tyr val thr leu asn leu pro cys ser leu leu thr pro pro ala ser pro ser pro 

ref 4 . 1 and 4 . 2 I 
782 ^ 815 

TCT GTT TCT TCT GCA ACA TCT CAG AGT TCT GGA TTT TCT ACG AAT GTA CAA GAC CAA AAG 
ser val ser ser ala thr ser gin ser ser gly phe ser thr asn val gin asp gin lys 



842 872 

ATT GCA AAT ATG TTT GAA TTA TCC GTG CCT TTC CGC CAA CAG CAT TAT TTG GCA GGA CTT 
ile ala asn met phe glu leu ser val pro phe arg gin gin his tyr leu ala gly leu 
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902 

GTG TTA ACA GAG 
val leu thr glu 

962 

AAG AAA GTC ATC 
lys lys val ile 

1022 

TCT GAG OCT GAG 
ser asp pro gin 

1082 

ATG GAA ACT GTA 
met glu thr val 

1142 

ATT TGT ATA GCC 
ile cys ile ala 



CTG GOT GTC ATT 
leu ala val ile 

AAT ATG GTA CAC 
asn met val his 



ATA AAG GCT CGA 
ile lys ala arg 



CCT CAG CTG TAT 
pro gin leu tyr 



ACT GAT GAT TAT 
thr asp asp tyr 



932 

TTA gac cct gat 
leu asp pro asp 

992 

AAT TTA CTC TCC 
asn leu leu ser 

1052 

GTG GCC ATG TTG 
val ala met leu 

1112 

GAT TTT ACA GAA 
asp phe thr glu 

1172 

GAA AGT GAG AGC 
glu ser glu ser 



get gaa gga ctg 
ala glu gly leu 



AGT CAC GAC TCA 
ser his asp ser 



TAT CTA CCT CTG 
tyr leu pro leu 



ACT CAC AAT CAA 
thr his asn gin 



GGA AGT ATG ATA 
gly ser met ile 



TTT GGA TTG CAT 
phe gly leu his 



GAC CCG CGG TAG 
asp pro arg tyr 



ATT GGT ATT ATC 
ile gly ile ile 



CGA GGA AGA CCA 
arg gly arg pro 



AGC CAG ACC GTT 
ser gin thr val 



1202 

GCC ATG GCA ATC GCA GGG ACA TCG GTC CCT 
ala met ala ile ala gly thr ser val pro 

I ref 5 . 1 and 5 . 2 
1262 ^ 

ACG TCA ACG AGT GGC AGG CAA CAC ACT ACC 
thr ser thr ser gly arg gin his thr thr 



1232 

CAA CTA ACA AGG CCT GGC AGT TTC CTC CTC 
gin leu thr arg pro gly ser phe leu leu 

1292 

TTT TCA GCA GAA TCA AGT CGA AGC CTT TTG 
phe ser ala glu ser ser arg ser leu leu 



1322 

ATC TGT CTA CTT TGG GTT CTC AAA AAT GCA 
ile cys leu leu trp val leu lys asn ala 

1382 

GAT CTC TCA GTC TTG CAG CTA AAC CGG CTA 
asp leu ser val leu gin leu asn arg leu 



1352 

GAT GAA ACA GTT CTA CAG AAG TGG TTT ACA 
asp glu thr val leu gin lys trp phe thr 

1412 

TTA GAT CTG CTT TAT CTC TGT GTG TCT TGC 
leu asp leu leu tyr leu cys val ser cys 



1442 

TTT GAG TAT AAA GGG 
phe glu tyr lys gly 

1502 

AAA GAC ATG AGA GCA 
lys asp met arg ala 

1562 

ATG GTA CGG CGA AGC 
met val arg arg ser 



AAA AAA GTG TTT GAA 
lys lys val phe glu 

AAG CTT GAA GAA GCT 
lys leu glu glu ala 

I 

CGA GGA CAG CTC GAG 
arg gly gin leu glu 



1472 

CGA ATG AAT AGC TTG 
arg met asn ser leu 

1532 

ATT CTT GGG AGC ATA 
ile leu gly ser ile 
f 6.1 and 6 . 2 
1592 

AGA AGC CCA TCT GGA 
arg ser pro ser gly 



ACC TTT AAG AAA TCA 
thr phe lys lys ser 



GGT GCC AGG CAA GAA 
gly ala arg gin glu 



AGT GCC TTT GGA AGT 
ser ala phe gly ser 



1622 1652 

CAA GAA AAT TTG AGG TGG AGG AAA GAT ATG ACT CAC TGG CGT CAA AAC ACA GAG AAG CTT 
gin glu asn leu arg trp arg lys asp met thr his trp arg gin asn thr glu lys leu 



1682 1712 

GAC AAA TCA AGA GCA GAG ATT GAA CAC GAA GCA CTG ATT GAT GGA AAC CTG GCT ACA GAA 
asp lys ser arg ala glu ile glu his glu ala leu ile asp gly asn leu ala thr glu 



FIG. 4A (2 of 14) 



1742 

GCA AAC CTA ATC ATT TTA GAT ACA TTA GAG 
ala asn leu ile ile leu asp thr leu glu 

1802 

TCC AAA GAG AGC ATT CTT GGT GGA GTG CTA 
ser lys glu ser ile leu gly gly val leu 

1862 

CAA AGT GCA GTT TAT CTA CAA CAC TGT TTT 
gin ser ala val tyr leu gin his cys phe 

1922 

CCT GAA CTC TTA TTT GAA GAA GAG ACA GAG 
pro glu leu leu phe glu glu glu thr glu 

1982 

CGA CAC TGT AGC AGT AGC ATC GGT ACA ATA 
arg his cys ser ser ser ile gly thr ile 

2042 

CTA ATG AGG CAA AAC TTT GAG ATT GGG AAT 
leu met arg gin asn phe glu ile gly asn 

2102 

ATG TCA CTA TCC TCC TTG GTG GGC ACA TCT 
met ser leu ser ser leu val gly thr ser 

2162 

TCT CTA AAG ACT ATA TTG ACA TAT GCT GAA 
ser leu lys thr ile leu thr tyr ala glu 

2222 

CCT GAT CAG GTC CAG GAT CTG GTT TTC AAT 
pro asp gin val gin asp leu val phe asn 

2282 

ATG AAG GAA CAC CAG GAG GAT CCT GAA ATG 
met lys glu his gin glu asp pro glu met 

2342 

ggt tac CAG ACC TCT CCA GAT CTG CGA TTG 
gly tyr gin thr ser pro asp leu arg leu 

2402 

TCA GAA CGA AGC AAT CAT GCT GAA GCT GCA 
ser glu arg ser asn his ala glu ala ala 

2462 

GCT GAA TAT TTG AGC ATG CTG GAG GAC CGG 
ala glu tyr leu ser met leu glu asp arg 
I ref 8 - 1 and 8 . 2 
>|r 2525 

CAG AAT ATT TCA TCT AAT GTT TTA GAA GAA 
gin asn ile ser ser asn val leu glu glu 

2582 

GAT GAA GAA GGT ATC TGC TCT GGA AAA TAC 
asp glu glu gly ile cys ser gly lys tyr 



1772 

ATT GTT GTT CAG ACC GTT TCT GTA ACG GAA 
ile val val gin thr val ser val thr glu 

1832 

AAA GTG CTA CTA CAC AGC ATG GCC TGT AAC 
lys val leu leu his ser met ala cys asn 

ref 7.1 and 7.2 I 
1892 ^ 
GCT ACA CAG AGA GCC TTG GTT TCA AAG TTT 
ala thr gin arg ala leu val ser lys phe 

1952 

CAG TGT GCT GAT TTA TGC CTC AGG CTT CTC 
gin cys ala asp leu cys leu arg leu leu 

2012 

CGG TCA CAC CCC AGT GCC TCC CTT TAC CTA 
arg ser his pro ser ala ser leu tyr leu 

2072 

AAC TTT GCC AGG GTT AAA ATG CAG GTA CCA 
asn phe ala arg val lys met gin val pro 

2132 

CAG AAT TTT AAT GAA GAA TTC TTA AGA CGT 
gin asn phe asn glu glu phe leu arg arg 

2192 

GAA GAT CTG GAA TTG AGG GAA ACA ACA TTT 
glu asp leu glu leu arg glu thr thr phe 

2252 

CTC CAT ATG ATT CTT TCT GAT ACT GTG AAA 
leu his met ile leu ser asp thr val lys 

2312 

TTG ATT GAT CTA ATG tac aga att gcc aag 
leu ile asp leu met tyr arg ile ala lys 

2372 

ACC TGG TTG CAG AAC ATG GCA GGC AAG CAC 
thr trp leu gin asn met ala gly lys his 

2432 

CAG TGT CTA GTC CAC TCA GCA GCA CTT GTT 
gin cys leu val his ser ala ala leu val 

2492 

AAA TAT CTT CCT GTG GGA TGT GTA ACA TTT 
lys tyr leu pro val gly cys val thr phe 

2552 

TCT GCG GTC TCA GAT GAT GTG GTA TCT CCA 
ser ala val ser asp asp val val ser pro 

2612 

TTT ACT GAG TCA GGA CTT GTG GGA TTA CTG 
phe thr glu ser gly leu val gly leu leu 
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2642 

GAA CAA GCA GCT 
glu gin ala ala 

2702 

AAA GTA CTT ATT 
lys val leu ile 

2762 

GGT AAA CTT CAA 
gly lys leu gin 

2822 

TTT GGC ACC TAT 
phe gly thr tyr 

2882 

GAA TTT GTT TAC 
glu phe val tyr 
I ref 10 . 1 and 
^ 2945 
GGA TTT TAC GGA 
gly phe tyr gly 

3002 

GTA GAC AAG TGT 
val asp lys cys 

3062 

TAC TTT GAC ACA 
tyr phe asp thr 

3122 

CGT CGA TTC ATG 
arg arg phe met 

3182 

GAA CAA TTC AAA 
glu gin phe lys 

3242 

AGG GTC AAT GTC 
arg val asn val 

3302 

GAC ATG CAG AAA 
asp met gin lys 

3362 

AAA ATG CTT CAG 
lys met leu gin 

3422 

GAA GTT GCC CAG 
glu val ala gin 

3482 

AAT AAA CTG CGA 
asn lys leu arg 



GCT TCC TTC TCT 
ala ser phe ser 



CCT ATT CAT GAA 
pro ile his glu 

GAA GCA TTC AGC 
glu ala phe ser 

TTT CGT GTT GGT 
phe arg val gly 

AAG GAG CCT GCA 
lys glu pro ala 
10 .2 

GAA AGA TTT GGA 
glu arg phe gly 

AAA TTA GAT CCT 
lys leu asp pro 

TAT GAG ATG AAG 
tyr glu met lys 



TAC TGT ACA CCC 
tyr cys thr pro 



AGG AAG ACC ATT 
arg lys thr ile 



ACT CAT AAA GAA 
thr his lys glu 



AAG ACA CAG GAG 
lys thr gin glu 

ATG GTA CTC CAG 
met val leu gin 



GTT TTT CTG TCT 
val phe leu ser 



CTC TGC TTT AAA 
leu cys phe lys 



2672 

ATG GCT GGC ATG 
met ala gly met 

2732 

GCT AAT CGG GAT 
ala asn arg asp 

2792 

AAA ATT GTT CAT 
lys ile val his 

2852 

TTT TAT GGA ACC 
phe tyr gly thr 

2912 

ATA ACC AAA CTT 
ile thr lys leu 

2972 

GAG GAT GTG GTT 
glu asp val val 

3032 

AAC AAG GCA TAT 
asn lys ala tyr 

3092 

GAC AGA ATC ACC 
asp arg ile thr 

3152 

TTT ACT TTA GAT 
phe thr leu asp 

3212 
CTG ACT ACG TCT 
leu thr thr ser 
I ref 11.1 
^ 3272 
GAG ATC ATC TTA 
glu ile ile leu 

3332 

TTG GCA TTT GCA 
leu ala phe ala 

3392 

GGA TCT GTA GGC 
gly ser val gly 

3452 

GAA ATA CCT AGT 
glu ile pro ser 

3512 

GAT TTT ACT AAA 
asp phe thr lys 



TAT GAA GCA GTT 
tyr glu ala val 

GCA AAG AAA CTA 
ala lys lys leu 
^ref 9.1 

CAG AGT ACT GGC 
gin ser thr gly 

AAG TTC GGG GAT 
lys phe gly asp 

GCA GAG ATA TCT 
ala glu ile ser 



GAA GTA ATC AAA 
glu val ile lys 



ATT CAG ATT ACC 
ile gin ile thr 



TAT TTC GAC AAA 
tyr phe asp lys 

GGC CGT GCC CAT 
gly arg ala his 

CAT GCC TTT CCT 
his ala phe pro 



ACA CCA ATT GAA 
thr pro ile glu 

ACA CAT CAG GAT 
thr his gin asp 

ACC ACA GTG AAT 
thr thr val asn 



GAC CCA AAG CTC 
asp pro lys leu 

AGG TGT GAA GAT 
arg cys glu asp 



AAT GAA GTT TAC 
asn glu val tyr 



TCC ACA ATT CAT 
ser thr ile his 



TGG GAG CGG ATG 
trp glu arg met 

TTG GAT GAA CAA 
leu asp glu gin 

CAC AGA TTG GAG 
his arg leu glu 

GAC TCT AAT CCT 
asp ser asn pro 



TAT GTG GAG CCA 
tyr val glu pro 



AAT TAC AAT CTT 
asn tyr asn leu 

GGG GAA CTT CAT 
gly glu leu his 



TAT ATT AAA ACA 
tyr ile lys thr 



GTT GCT ATT GAG 
val ala ile glu 

CCC GCA GAC CCC 
pro ala asp pro 

CAG GGG CCT TTG 
gin gly pro leu 



TTC AGA CAT CAT 
phe arg his his 



GCC TTA AGA AAA 
ala leu arg lys 
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3542 

AAT AAG AGC TTA ATT GGG CCG GTT CAA AAG 
asn lys ser leu ile gly pro val gin lys 

3602 

TCG CCT TAA AGA GGC CCT ACA GCC CTA GAT 
ser pro OCH 

3662 

TAT TGC TTG TCC CCT GCC ACA GAG ATT CCT 
3722 

TCT AAA CTG AAT GCA CTT GTT TTA TTC ATC 
3782 

TGA AAA GAT CTA TTG GAA ACC AAC ATG GAA 
3842 

AAT GCA GTG GCC AAG AAA ATA TCA AAT GTA 
3902 

GTT TCT AAT GTT CGG GTA ACA AGC TGT TAT 
3962 

CAC TAT ACA TTT ACC ATT ATT TAT ACC ATA 
4022 

TCG TAT TTT TTA ATT TAT ATC ACC ATT TAT 
4082 

TAA TGC TTA TTT TAA AGG TAC TAA AAA ATA 

4142 
CTC 



3572 

GAG TAT CAA AGG GAA TTG GGG AAA CTA TCT 
glu tyr gin arg glu leu gly lys leu ser 

3632 

CAC AGA AAG TCC CTC AGT TAT CCA AGC CAG 



3692 

TCA GTC GAA TGA GCT TTC GCA AAA TGG ATC 
3752 

TGC AAA GAG CCA TGT ATT CAA CAT CGA GTG 
3812 

TGG AAT TCT GGA AAT TAT TAT TCA TTG AAG 
3872 

GAT TGT TAA CGC TTG AGA ATC ATG GCT ATG 
3932 

CTT TTA AGA CAT TTT AAT GAC TCA AAG GTA 
3992 

GCT AAG GTT AAA AAT TTA TTC ACT TTA AGT 
4052 

AGA TTC ATT TTG GAC CCA TTT TAA ATG TAG 

I ref 12.1 
4112 ^ 

TGT GAA TGT TTA CCT CGT GCG CGC CAG GGC 
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Ref 1 . 1 

Sequence of BAC8 using primer C3S3, which spans nucleotides 
341-360 of the cDNA. Exon sequence is underlined and 
represents nucleotides 365-381. 

TTTTTTGAATTAATGGT GAGCAAAAACTGAGCATGTTCTTTAATATTTTTTCTCTTAGTG 

AACAATTTTATGCTAGCTCATTTGTTACCTTAGAAATCTTTTTCTGTTGCACATCTTAAC 

GCTTTTCCATGTGCCTCTAAGACAAAATTACATGTGTTACATCTCTAAATAAACACTGT 

GGACACTCAACACAGTTTAGGTGGAATTAAGAGTGAGGCTCATTTTAACTCTTATTTTC 

TCAGGGATGGTTGCATAAGCTAGCTATATTTTCAAAGGAAACTTGTGATACATTCTTTG 

CTAGTCATTATACATGAAGTGTATAATGACAGTATTGTAGATTTTATACCAAAGATGG 

AAAGAGCTTTATAGATACCCACTGCTATTGTTATGGCTAGTAAACCCTTAGGGAAATG 

CCAGTTACAATCAATAAAAAAACAACAGTCTGGCTGGGTGCAGTGGCTCACACCTGTA 

ATCTCAGCACTTTAGAAGGCCGAGGCAGGAGGATC ACTTGAGATCAGGAGTTTGAGAC 

CCAGCCTGGGCAACATAGCAAGAGCCCATATNTACCCAAAAAAAATTTTTTTTTAAAT 

TAAGCTAAAACCCTGGNNGGCCACAAAACCTGTAGTTCCCATCTACTTTGGAAAGGCT 

TGAAGGANGGGAGGGCTTGCTTTGAGCCCCAAGAANGTTCAAAGGCTNGCNGNCAGG 

TTNTGATTCNACACNTGCAACTCCCGCATTGGGTNAACAAAANCCAAGGAANC. 

Ref 1.2 

Sequence of BAC9 using primer C3S3, which spans nucleotides 
341-360 of the cDNA. Exon sequence is underlined and 
represents nucleotides 372-381. 

AATTAATGGT GAGCAAAAACTGAGCATGTTCTTTAATATTTTTTCTCTTAGTGAACAAT 

TTTATGCTAGCTCATTTGTTACCTTAGAAATCTTTTTCTGTTGCACATCTTAACGCTTTT 

CCATGTGCCTCTAAGACAAAATTACATGTGTTACATCTCTAAATAAACACTGTGGACA 

CTCAACACAGTTTAGGTGGAATTAAGAGTGAGGCTCATTTTAACTCTTATTTTCTCAGG 

GATGGTTGCATAAGCTAGCTATATTTTCAAAGGAAACTTGTGATACATTCTTTGCTAGT 

CATTATACATGAAGTGTATAATGACAGTATTGTAGATTTTATACCAAAGATGGAAAGA 

GCTTTATAGATACCCACTGCTATTGTTNTGGCTAGTAAACCCTTANGGAAATGCCAGTT 

NCAATCAATAAAAAAACAACAGTACTGGCTGGGTGCAGTGGCTTACACCTGTAATCTC 

AGCACTTTATAAGGCCCNAGGCNGGAGGATCACTTNAGATCCAGGAGTTTGAGACCAG 

CCTGGGCAACATANCAAGAGCCCATTATCTACCAAAAAANTTTTTTTTTTAAAATTAAG 

CTAAACNCTGGGTGGNACAAACCTGTTNGNTTCCNATNTNCCTTTGGAAAAGCTTANG 

AAGGGGAGGGCTTNCTTTGGANCCCCAAAAAGTTNAAAGGGNTTGCAGTCAGCCTTTT 

NAATCACCCNNNGGNCCTNTCGCATTGGGATTNCCAANANGCCAANGNAACCCCGNT 

CNTNTTTTAAAAAANTNTTTTAAAGNANNTTTNTTNGN 

Ref 2.1 

Sequence of BAC8 using primer HC3AS7, which spans 

nucleotides 387-406 of the cDNA. Exon sequence is 

underlined and represents nucleotides 382-386. 

lOTITlSnsrGAAAAAGATTANATTATTAATTCTATGATATATTAACAATACACANCTC 

CACTTGGACTATTTTTAAAATATGGCATGTAATTTAATAGATGACTGAAATATTTTAGC 

TTCTCAAATATTTTTTAAAGTTCCCTACAATGTTTTGTATTTGCTTAAAATAAAATANA 

AAAACCACCATATTACTTTCAGAAAATTATGCTAGCTAACAATAGGACAAAAAATTCT 

GTGTATGTCAACAAAAAAAATTCAACCTTAAATTTTTTTTTTTCCATAAAAAACAGGGC 

TACTTGCCCAGGTGAGANGTGCTGCCGTATGAGCTCCTCGNTAGATTGCGCNGCCGGA 

NTGTCGGNCCCTNCGTTTAATATAACGGCGTGNGCNTGTACCGCAGGCTNTGCTAGGT 

CGTGNTCCCAAGATATCNTNTNTANCATANTAGACGNTGGNGNCGNNTGCATGTGGCN 

TNATTNTNGCAATTGTNACAATCCTAGTNTGTACNTNANAGNTCNGCCNCTGTGANNT 

CGTTGTATAGTCNGNGGCNCGCTTGNTTCTGATGCTGAGAGCANTNNCNNACTNTTNN 

NCNCCCT^TCTTTNCNNTTlSnsrNlSrNCCCCCNTTT]^ 

NTNAANNNACCNCC 
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Ref 2.2 

Sequence of BAC9 using primer HC3AS7, which spans nucleotides 
387-406 of the cDNA. Exon sequence is not found within this 
sequence. Since the primer is directed against exon sequence 
we presume that sequence derived from HC3AS7 is intron 
sequence. Additionally, this sequnce matches the intron 
sequence found in the previous sequence (BAC8 sequenced with 
HC3AS7) . 

GCGCTNCCNNNTNNTTTATCTTCTGAAAAGACTNATATNATTCTATGATTATAACATTA 

CACACTCTAACACTGGACTTNTTAAATATGGATGTAATTAATAGATGACTGAATATTTT 

AGCTTCTCAAATATNTTTAANGTCCCTACAATGTTTGNATNTGCTTAAAATAAAATANA 

AAACCCCATATTACTTTCAGAAAANTATGCTAGCTAACAATAGGACANAAAAATTCTG 

TGTATGCAACAAAAAAAAATTCAACCTTNAATCTTCTTTTTTTTCCAATANAAAACAGGG 

CTACTCTGCCACAGGCTGGAGTCAGTGGCTGATACAGCTACTGCAGCTCACTCCGGG 

CTATGTGATTGCCTGCCTAAGCCTCNGAGTAGTAGGCTCAGGTGCCACTACATGCCAG 

TAATCTAAAATTTATAGAGACAGGGCTGCTGTGTGNCCAGGCTGGCTAACTCCGGGCT 

AAGCGTTCTTGCCTNGCTCTAAATGTGGGATACAGNATGTATCATNCATCAGCCAAAA 

AGTTAATTAANTTCCAGATNANTATTTGCATCAAAGCTCCAATNTAGCTTGAAGTAGA 

ACCTGCTCNTTGGCTAGANTATCCCGNNTGTTATGGATCATATTANGCNNTTGTGATGC 

CGAATGCNATCTATTCCGGGAGACANATTACTATNGGATGANAGCANATNGCCCNNAT 

GCTTNTTTGTAACGCTNNANNTAAGAACNTTCTNGACATCGTCATAGNTCGAAGTNNT 

NNGCGANTTGATACTAANTTCATGNTANGCCNATGACTNTNGTGATTNNTGANTGNCT 

GGGAGAACCTACNTNCCCCNTACNNATANNCTNCACCCCCTACTACTNTNNCNNTCNC 

TCTCTANTTCTACTCCACNTTATTATCCTCNNCCTTCNCATCNTCCCATCNTNATTCNAC 

GCCNCNANACTTANCNTTNATNCACTCTNNCT 



Sequence of BAGS using primer C3AS4 , which spans nucleotides 
737-756 of the cDNA. Exon sequence is underlined and 
represents nucleotides 677-736. 

TAATGTACATAGTGCTCATGACTGCAGATGATTCGTAGAAAATCCAGCCTCAAGGACA 

CCAGAACACTGGGATTCGGTAATGAGTAAAGCTTTGAAGACACCTTGTAAGCAATGCA 

TAAGTAAGAGAACACCAATTGAATCTATTATTTCTTTAATACTAATACCAGAATGGCA 

AATTAGAATTAAAGAGATAGTACTTGGTATCCAGTTTGGGTTTTGTGGCTTAAGTAGCA 

GTATCACCTTTTTCCAGAGTTACTGCTAAAATTAAAAATTTTAAACTATCAGGTTTACT 

GTATAAACATATTTGACTAACCTAAAAGCCACATTCTTGTATTTCCAATATAGCATCAA 

TATTTCTACTTCTCATAAAACAGGGAAAACGTATATCACCAAAAATAACTTCTTATTAC 

TTCCTTCTTAAAAGAAATTATCAATTCTTTTTATAGCACTTTGTGCTTACCTGTATTTAT 

AATTTGTCTGTTTTCTCAGCAACATCATAAGCTACTTGAGGAGACATACTATAAACTGA 

TTTAACAGCTTTAGTGTCCCTACAGCTTAGCTCAATGTTTGACAAATATAGGAGATCAA 

TGCTTAAAGGAATAAAGGCCAGGACAAGTTCTGGTAGCAAATAGTCCATAAAAGGTTT 

TGGGGGAAAAGGGTAAAAATGGATACATATCGGGGTNGCAAGNTTTTTCCATGTGGGG 

TGAGGTGCCCCATGCCTT 



Ref 3 . 1 




Ref 3.2 



Sequence of BAC9 using primer C3AS4 , which spans nucleotides 
737-756 of the cDNA. Exon sequence is underlined and 
represents nucleotides 677-731. 

TAACATAGTGCTCTGACTGCAGATGATTCGTAGAAAATCCAGCCTCAAGGACA CCAGA 

ACACTGGGATTCGGTAATGAGTAAAGCTTTGAAGACACCTTGTAAGCAATGCATAAGT 

AAGAGAACACCAATTGAATCTATTATTTCTTTAATACTAATACCAGAATGGCAAATTA 

GAATTAAAGAGATAGTACTTGGTATCCAGTTTGGGTTTTGTGGCTTAAGTAGCAGTATC 

ACCTTTTTCCAGAGTTACTGCTAAAATTAAAAATTTTAAACTATCAGGTTTACTGTATA 

AACATATTTGACTAACCTAAAAGCCACATTCTTGTATTTCCAATATAGCATCAATATTT 

CTACTTCTCATAAAACAGGGAAAACGTATNTCACCAAAAATAACTTCTTATTACTTCCT 

TCTTAAAAAGAAATTATCAATTCTTTTTATAGCACTTTGTGCTTACCCTGNATTTATAAT 

TTGNCTGNTTTTCTCAGCAAACATCATAAGCTACTTGAGGGAGACATACTATTAAACCT 

GATTACAGCTTTTANGTGTCCCTACAGCTTAACTCAATGTTTTGCAAAATNTNNGGAGA 

TCAATGGCTTTAAAGAATAAAAGANCAGGGACAAGTTNTGGGTNGCCATNAGNACAA 

TAAAGGTTTTNGGGGGAAAAGGGAAAAAATNGATTNCATNTCGNGGTTNGCAAGGTN 

TTTTCCATTGNGGGGNGGAGGGGCCCATGCCATAANTTTTAACCTTTCTTTTTTNGAAG 

AAATTAAACNNTTAAAGGGGTN 

Ref 4.1 

Sequence of BAC8 using primer HC3AS6, which spans nucleotides 
925-945 of the cDNA. Exon sequence is underlined and 
represents nucleotides 813-920. 

CCAGTCTGCAATATGCTGTGCGAAGCCGATATCAACTTTGCATCTTTGTCTTGNCATTC 

GAGAAATC AGACTTGTGGAAGTAGGAGACAGCTTACAGCGTGCACAAGCTCTCAGCA 

GAGCATATACGAATGAATCTTTTCCAGGGAGTTATTTATATACTACCTGAGCAAGCCA 

CTTTAGCTTTGGGCAGGAACTTNTGGATGTTATAAGTAATACTTATATGAATAATATGA 

AATTAATATTTACTTCTTTTACANTCTTCTCTTTTCCTTATCTTAGCCTTTATCCCCTTGT 

GGAAAAGACACTATCAATGCTAGATNCTCCCAAGNCAGAGAATTATGCAGGTTTGGTC 

AGAGAATCGACACAGACATGTTTACAGATTCTTCTTGAAATACATATTGTGCACGAGT 

TTTTTACANTATCTCAATTTAGATCTCAGACAGCATNTNGACTAGNGGGTCTAGGACAT 

AGATACATNTTTGNGAACTTCTATAGAANAACNTNTGCNTTAAAAAGGAGCTTGTTNG 

ANANGAATNNNCTGNGAAGGGCCCGATACGANAATTTGACTTCGGNGAAAATTNm 

GATTNNTACAAANTTCTAGGNGGCACCTTNAAAANGNNTGGGNACNTTGGNGGCGGA 

AAAAAAGCCCTTCNTTTAGNTNTCCCNGAAATGGAAAAGTNCCAANTTCCNAAAAAA 

ANGGGCTTTGTTNNCTTNCNANA 

Ref 4.2 

Sequence of BAC9 using primer HC3AS6, which spans nucleotides 
929-949 of the cDNA. Exon sequence is underlined and 
represents nucleotides 813-924. 

GACGCCAGCTCTGTACACAGTCTGCAATATGCTGTGCGGAAGGCCGATATCAACTATT 

GCATCTTTGTCTNGNCATCGAGAAATC AGACTCTGTGGAAGNAGGCAGACAAGACTAT 

ACAGCNTGCACANAGCATCTCAGCAGGCATATAAGAATGAANCTTTTCCAGGGAGTTA 

TTTATATACTACCTGAGCAAGNACTTCAACTTNGGCAGGAACTTGTGGATGNTTATAA 

GTATACTTATATGAATAANATNGAAATTAATATTTAATTCTTTTACTTCTTCTCTTTTCC 

TTATCTTAGCCTTTATCCCCTCGTGAAAAAGAGCACTAATCAATGCTATTNCTNCCAAG 

NCAGGAATTTATAGCAGGTTGGTCGAGAATCGACACGACATGTTTACAGANTCATCTT 

GAATACATNATTGTGCACGAGTNTTTTACTCTATCTCAAAATATAGATCTCAGATCGTC 

TATNGANTATGNGGTTCTAGGACATGATTACATTTTTNGGGAACTTCCATAGAATAAA 

CNTNTACCTNAAAANANGAGCCTGTTNGAAATNGAATCTACTNCTAAAGGGCNAGTNC 

CANATTTTACTTCCGCGANATNTCNGGATGTTACAAGTCTAGGGGGNCTTTAGNACGT 

TNGATNTTTGANCGGAAAAAAGCCCTTCTANNGGTCNCCTAATGGAAGCGCCAATTCC 

NAANAAGGNCTGTGTTNTTNGACATTTACCNGNNCCNTTTCTAATCAAACNTNCTCTTC 

TNNNANCCNCTUSrCNCNNNCCTATANNCCTATCNCTasnsrCTN^ 

NCTNTCTTCCNTTCTNCACTNTNlSnjmTCNCTNNNATNNNCTTCTCCNATCOT 

NNNTCANNCTCCCTACNNTNCNCNTNTTACCATCTNCNCCm 




Ref 5.1 

Sequence of BAC8 using primer C3S6, which spans nucleotides 
1128-1147 of the cDNA. Exon sequence is underlined and 
represents nucleotides 1182-1270. 

GTATGATCCGCCAGACCGCTGCCATGGCAATNGTAGGGACATCGGTCCCTCAACTAACAAG 
GCCTGGCAGTTTCCTNCTCACGTCAACG GTCAAAACAATCCTTCTACAGAATTTTTTTTTC 
TNGAAAGACAAATATTTACTAGGATATGCCCTTAAATATATGAGATGATTGTATCAGCTGA 
TGCAAAAGTGCTCAGTTTTATTTATGAAAATATTAAAGTTCCCAGAATATTAACTGTCTTC 
TCCCAAACAGTTTTAAAAAATGATTACCTCAAGGTTTATGGGAAAAAGCCCCGTATTCTGC 
ATTCAGAATTTGGAAAATTGCCTCATTATAGATAGCCATNTCTTTTTTTTNTTTTTTTTAT 
NCTTCAAGTCTTAGGGNACATGTGCACAACATGCAGGNTAGTTACATATGTATACATGTGC 
CATGTTGGTGTGCTGCACCCANNAACCCGCAATTTAACATTAGGTNTATCTCCAAATGCTA 
TCCNTTCACCCTTCCCCCATNCCACAACAAGGCCCCGGGCNTTGNGATGTTCCCCTTCCTG 
TGCCCACTGTGTNTCACATTNCCNCTTCCCNCCCTTANTNNNGTGCAGAACNTNGCCNGT^ 
GCCCTNTNTTTTTNNCCC 



Ref 5.2 

Sequence of BAC9 using primer C3S6, which spans nucleotides 
1128-1147 of the cDNA. Exon sequence is underlined and 
represents nucleotides 1152-1270. 

CCNCTGATGATTATGAAAGTGAGAGCGGAAGTATGATAAGCCAGACCGTTGCCATGGCAAT 
CGCAGGGACATCGGTCCCTCAACTAACAAGGCCTGGCAGTTTCCTCCTCACGTCAACG GTA 
AAAACAATCCTCCTACAGAATTTTTTTTTCTAGAAAGACAAATATTTACTAGGATATGCCC 
TTAAATATATGAGATGATTGTATCAGCTTGATGCAAAAAGTGCTCAGGTTTATTTATGAAA 
ATATTAAAGTTCCAGAATATTTAACTGTCTTCTCCCAACAGTTTTAAAAAATGATACCTCA 
GGTTTATGGGGAAAAAAGCCCCGTATTCTGTCATTCAGAAATTTGGAAAATTTGNCTCATT 
ATAGATAGTTCATTTTCTTTTTTTTTTTTTTTTTTTATACNTTTAAGTTTTTAAGGGGNAC 
CATGTTGCACCAAANATTGCAGGGGTTNGGTTACCATTATGGTTATTNCCATTGGTNCCCC 
CANTGTTTGGGGNGTTGGCTTTGCCACCCCCCAGNGTAAAACCNNCCGNTGCGAATTTTTA 
AAACAANTTTTGGGGGTTATTANTTNTTTCCCAAAAATNGGCNTTTTTNCCCTTTNCCCCC 
CTTTCNCCCNCCCNNTTCCCAACNNANCAAGGGGCCCCCCGGGTANTGGGGGAATAGNTTT 
CCCCCCCTTNCCCTNGNNGGGCCAATTGTGGGGNNNCTCCATTTGGNNTGCAAANTTCCCC 
CACCCNTNATTGTTGGTGGNGAAACCATTTCCCGGGGGGTTTGGGGGTTTTTTTTGGTCCC 
CNTTGCCCAANTAATTTTTGCNTTGAANAAAAAGAATGGGGTTTTCCAAAGCTTTTNGTCN 
CCATTTGNTCCCTTTANGGNNCCNTTNGTTCNCCTTNCCANAANGGGCCAATGTGAAACN^ 
CCTTTCATTTTTTTTTATTGGGGNTTNCCNTTATGGN 

Ref 6.1 

Sequence of BAC8 using primer C3S7, which spans nucleotides 
1514-1533 of the cDNA. Exon sequence is underlined and 
represents nucleotides 1536-1589. 

TTCTTGGGAGCATAGGTGCCAGGCAAGAAATGGTACGGCGAAGCCGAGGACAGCTCG GTAC 
GTACACAATAGCTTCTCCTCCTGGTGAGAATTTCTTCAATTTCCTTGAGTTGTATATTGTA 
ATGATCATTGTTGCTAGTCTTCAATGTCAATCCTATGCTTTTTAAAAAGTGTTTTAAGTGT 
AACTGTGAATTAACTTGAATAATCATTTCTCTGCAGTAATAAAAGTTAGAATTCTGATTTA 
GGTGAGTCAGCATACCGCCCCCCCCCCCGTTTTCTCTAGAAAGTCTTCTCTAGAAAACGTT 
CTCTAGAAAGTCCTATCTAGAAAACTTTCTCTAGAAAGTCCTTATGTGATTAATAGCATCC 
ATCCTCCCTTTTTAAATAGACTTTATTTTTGTAGAGCAGTTTTAAGTTCACAGCAAAAGTG 
AGCAAAGGTACAGAGATTTCCCATATACCCCTTAGTATGCGTAGCCTCCCCCATTATTAAC 
ATCCCCCATCAAGAGTAGTGCATTTGTTGTAACTGGTGAACCTACATTAACACATCATCAC 
CCAGAGTCCGCAGTTTACATTAGGGATCATTCATATAACATCTATTTTTACTTTTTTTTTT 
TTAGTTGAGACAAGATTCTCGCTCTGTCACCCAAGCTGGAGTGCAGTCCGNGTGGATTGTN 
GGCTTACTGNCN 




Ref 6.2 

Sequence of BAC9 using primer C3S7, which spans nucleotides 
1514-1533 of the cDNA. Exon sequence is underlined and 
represents nucleotides 1536-1589. 

GTGCCGGCAAGAAATGGTACGGCGAAGCCGAGGACAGCTCG GTACGTACACAATAGCTT 
CTCCTCCTGGTGAGAATTTCTTCAATTTCCTTGAGTTGTATATTGTAATGATCATTGTT 
GCTAGTCTTCAATGTCAATCCTATGCTTTTTAAAAAGTGTTTTAAGTGTAACTGNGAAT 
TAACTTGAATAATCATTTCTCTGCAGTAATAAAAGTTAGAATTCTGATTTAGGTGAGTC 
AGCATACCGCCCGCCCCCCCGTTTTCTCTAGAAAGTCTTCTCTAGAAAACGTTCTCTAG 
AAAGTCCTCTCTAGAAAACTTTCTCTAGAAAGTCCTTATGTGATTAATAGCATCCATCC 
TCCCTTTTTAAATAAGACTTTATTTTTGTAGAGCAGTTTTAAGTTCACAGCAAAAGTGA 
GCAAAGGGCAGAGATTTCCCATATACCCCTTAGTATGCGTAGCCTCCCCCATTATTAAC 
ATCCCCATCAGAGTAAGNGCATTTGTTGTAACTGGNGAACCTACAATTNACACATTNTN 
ACCCCACAGTCCCGCAGGTTTACATTTATGGGATCATTCCCCTANTAACACCTATTTTT 
TTACTTTTTTTTTTTTTTAGGTTGAGACAAGAATTTTCGGCTTCTTGTGTACCCCCAAA 
CTNGGTAGTAGNCNNACCGTCGNGNAATTTNTGGGGTTTCNTNGNGNNCANTTTGTGCN 
NTCNNCTTNNCNCCNAAAGAANTTTTTTTTCACCCTTTTTTTTCCCCCNAANANANNAN 
CCTTCCCCTTGGNGGANGCTGGGGACTTCCNCAGNGGNGG 



Sequence of BAC8 using primer C3S8, which spans nucleotides 
1843-1861 of the cDNA. Exon sequence is underlined and 
represents nucleotides 1862-1918. 

CAAAGTGCAGTTTATCTCAACACTGTTTTGCTCACAGAGAGCCTTGGTTTCAAAG GTAG 
GTTATTTTGTACCTGCAGTGTTGTCAGACTTTGTTTTTTTTATTAACATTGTCTAAGAT 
CATTTGACACATTCATTGGTTAATATATGTAGTAATATATTAATGAATATGTGTAGTTA 
AAATTTAAATAATAACCTAAGACCCTTAATTCTTCTTTGCCTCTCTACTGCTGCCTGCC 
TTTTAGAATTTTTCATTTATTCGAATCACCTTTAACCAGTTCTGGTTTGAAAAACAGTA 
ACTTGGATGTGGAGAAGGGCCTGAAATTAATAGCCAATCTTAAATATGGGGCTTCTCTT 
GTTTTCTCTTCACTTGGTTCTGTTTTTATAAAAAACTCAATTTATAAAGAATTCAATAT 
ATAAGCAATTCAACCCACTGAAATTATTTTATGATGAATGGAAAAGAAGGTATGTGTTT 
GTTCAACTGCTTTAAATGTTTACTTCTTATATTTGTTTTCCCTTAGAAATATGTATATT 
CTTAAATTTTGAAGGTAGCTATTTCATTTTAATCATCCTAGAGGATGGAATGCANAGAT 
GTTGGATGAAAATAACTTACGTATTATTTTGTAATAAATATAAGAATTCATATATGGTT 
GATTACCTAAGTGGTTTTATGCACATTCTGATAGAAAGCTTCACCAACAATCCCTTGNT 
NGATATATTATTT 

Ref 7.2 

Sequence of BAC9 using primer C3S8, which spans nucleotides 
1843-1861 of the cDNA. Exon sequence is underlined and 
represents nucleotides 1865-1918. 

AAGTCAGTTTATCTACAACACTGTTTTGCTACACAGAGAGCCTTGGTTTCAAAG GTAGG 
TTATTTTGTACCTGCAGTGTTGTCAGACTTTGTTTTTTTTATTAACATTGTCTAAGATC 
ATTTGACACATTCATTGGTTAATATATGTAGTAATATATTAATGAATATGTGTAGTTAA 
AATTTAAATAATAACCTAAGACCCTTAATTCTTCTTTGCCTCTCTACTGCTGCCTGCCT 
TTTAGAATTTTTCATTTATTCGAATCACCTTTAACCAGTTCTGGTTTGAAAAACAGTAA 
CTTGGATGTGGAGAAGGGCCTGAAATTAATAGCCAATCTTAAATATGGGGCTTCTCTTG 
NTTTCTCTTCACTTGGTTCTGTTTTTATAAAAAACTCAATTTATAAAGAATTCAATATA 
TAAGCCATTCACCCACTGAAATTATTTTATGATGAATGGAAAAGAAAGGTATGTGTTTG 
TTCACCTGCTTTAAAATGNGACNTCNTAATATTTTGGNTTTCCCCTTAAGAAAATATGT 
ATAATCCTTAAAANTTTNGAAAGGGANGCTANTTTCATTTTTTNAATCAATCCTAANAG 
GGATTGGGAAATGCNCAAGATTTTTTGATTGAAAAANAACCTTANCGNATTTAATTTTT 
NGGNAATAAAATAATTAGNAATTCCNTATTATGNTTNGAATTACCTAAAGTGGTTTTTA 
TTGCCCCATTTCNTTGATATGNAAAGCCTTTCACCAACCAAATTCCCCNTTGNNAGGAA 
TATTATTTTTTNANGGGCCTCNTNTTTGTGGGGNTGGAAGNAAAAACCTTTGTTCCAAA 
GGGTCCCCNC 



Ref 7.1 




Ref 8.1 

Sequence of BAC8 using primer C3S10, which spans 
nucleotides 2413-2432 of the cDNA. Exon sequence is 
underlined and represents nucleotides 2433-2524. 

AGTGCTAGTCCTCAGCAGCACTTGTTGCTGAATATTTGAGCATGCTGGAGGACCGGAAA 
TATCTTCCTGTGGGATGTGTAACATTTCAG GTAGGAATCTTCCAGATGTACATTAAATP 

AAGGTATATCTTTTTTTGGTTTTAGCTTTTCTCACTGGTGTTTAGATTTTTTTAGTTTA 
TAAGGAAAGCTTAAAGACTTAAGCCAATGCTTCACAAGGTGAATTAACATTTCACAGTG 
ATTGTCATTAATACATTTTTAAGGAGTACTTCTTGTTGATTCTCTTTCCACAGTTTCTT 
ACCTCTGAATTATCAGCACTATGCTTATTTATTCTCTTTGGCTTTACTGNCTTGNAATC 
CCGTTACATACTTTAACATCTATGGAAATGTATTACTGATAATCAGAATTCAGTAGAAA 
TTCTTAATTGGCTTTTACTTCACATAGCAGATATACCAACATTCTCTATTCCCTACATA 
AAATATTAAGATTATTTTATGACTAATACCCATGACTCACAGATGAGTTTGCCCTCTAG 
TAGGGTCATAATTCTGACCCACTAGTTGAATTCTCTGCTTACCAAGAGNCAGGTATGCT 
TGCTTTTTCTTCAAAACCTGTTAAATAGTAGGNTTGGGGATATTNTAAAAATTAGGTAA 
ATGGTATATCTTCTGGTGGAAANCAGAAN 

Ref 8.2 

Sequence of BAC9 using primer C3S10, which spans 

nucleotides 2413-2432 of the cDNA. Exon sequence is 

underlined and represents nucleotides 2445-2524. 

CAGCAGCACTTGTTGCTGAATATTTGAGCATGCTGGAGGACCGGAAATATCTTCCTGTG 

GGATGTGTAACATTTCAG GTAGGAATCTTCCAGATGTACATTAAATCAAGGTATATCTT 

TTTTTGGTTTTAGCTTTTCTCACTGGTGTTTAGATTTTTTTAGTTTATAAGGAAAGCTT 

AAAGACTTAAGCCAATGCTTCACAAGGTGAATTAACATTTCACAGTGATTGTCATTAAT 

ACATTTTTAAGGAGTACTTCTTGTTGATTCTCTTTCCACAGTTTCTTACCTCTGAATTA 

TCAGCACTATGCTTATTTATTCTCTTTGTCTTTACTGCCTTGTAATCCGTTACATACTT 

TAACATCTATGGAAATGTATTACTGATAATCAGAATTCAGTAGAAATTCTTAATTGGCT 

TNTTACTTCACATAGCAGATNTACCAACATTCTCTATTCCCTACATAAAATATTAGGAT 

TATTTTATGACTAATACCATGACTCACAGATTGAGTTTGCCCTCTANTAGGGTNCATAA 

TTTCTGACCCACTAGTTGAATTCTCTGCTTACCAAAAGTCANTTATGCCTTTGCTTTTT 

CTTCAAAACCCTGNTTAATTAGGNACGGCTTTGGAGATAATTTATAAAAAATTTCAAGC 

TNAAANTGGNTTATTATTCNTTCCNNGGTTGAAAAAACCCAGGAATTGGCACAAANNAA 

NAAAAAGNTTATTCCNGGTTTCTTTNCGGNAAAAAAACCAAAAAATCTTNGAAATTGTT 

TTTTACCAAAAANGACCTCCNCNGGGAAAAAGGGNGTAAATTTNTTCCNTAAAAACNN 




Ref 9.1 

Sequence of BAC9 using primer C3S11, which spans 

nucleotides 2680-2699 of the cDNA. Exon sequence is 

underlined and represents nucleotides 2712-2800. 

TTCCTATTCATGAAGCTAATCGGGATGCAAAGAAACTATCCACAATTCATGGTAAACTT 

CAAGAAGCATTCAGCAAAATTGTTCT^TCTVG GTAATGATTCCAATTTCTAGCTTCACTAT 

AAAGGGAAAAAACTGTCTGAAAGCATTAATGTTGTTTTGCACTGATGTCAAACTAGATC 

CCGTGAAATGACCATTTTAATCAGACTACAAATGAGCGGTCAAAATGATAGTTCATGGC 

CAAAGCAAAGCTCATTAACAATAAAAATGAATTCACCTAAAGTAAATGGTGATCATCAT 

AAACTTTCTGCATAGCTTTTTTTTTTTCATTTTTGAATTATTAATTAAGCAAGTTTTTA 

AAAATTGTGATTTTCTGTTTCACAAGGNAAGATCATAAGTTGNGGAATCTCATTTTTAA 

AAATTGATACCCTATTNCTTTTGCTGNGGAAAANTGGAAGTTTTTTAATATTTTCAAGG 

TTTTTTTTAAAATTNAAATGGATTGTGGAAAACCTTTTAAATNAATTTAAAACCTACCT 

AAAATANTTTTTTTAATGGNCCNNGCCANCTGGAACCNTTTTATTTTTTTCCCCTAGGA 

ATGGTTTTACCCAAATCCATTCCCTTTTGAATAATATTTTTCCCTNAATTNCCCAAAAA 

ANTTTTTNTTTTTTTGGGNGGAAAAAATANTTGGAAAATTAAAAAAATGGGGGTGGGGC 

CNTAAATGGGGATTATTTTTAAATTTCCTAAAAAAGGGANTTTTCCATTTACCTTTNAA 

TCCTTTTTGGGNGGNTTCNATTTATTGGGGAATCCTNCNCTTTTTTNTNCNCCTTAAAA 

AANTTAGGGCCTNCCAAAATTTTAAACCNTTTAATTTTTNAAAANGGAAAGGGNCCCCT 

TTCTTNGCCCGGTTGGTTT 

Ref 10.1 

Sequence of BAC8 using primer C3S12, which spans 

nucleotides 2909-2928 of the cDNA. Exon sequence is 

underlined and represents nucleotides 2929-2942. 

TACAGATTGGAGG TGAATGCTGTGGTGGTTCATAAAATGTCATCTTTAGTTTGTATTCT 

CTCTGATGATTAGACTTTCAGATCCAGATCTAATCATTTAGTAAGCCAGATCTTGCCAA 

ATAAACTACTCCGTTAGAGAATAAGGACTTTTAATAGTTACAATAATACTCTTTCAAAT 

CTTTTATGGCAGCAATAAAATAGTAATATTGTCTATTTTTTGAGACTATTTTCACACAT 

ATTTTAGAAACCCCTGTATCCTTCAGAATTACTGCGACTTAACGGAGAAATATATAGTA 

TAATCCCACATTTTGTTGAAAAAGACAAAGAATTAAGTAGTAGCTAATAATTGAACTAG 

AACCAGAACCCTAAGAAATTTCTGACCCAAGCATATTATCTCTTTGGCTTAACTGGTTC 

CAGGTGAGGTTTCTTTAGAACGTAAAAGCCTGAAATCACACCTTAAAAACACTTCCTTT 

AACCTTTATAATTTCTTAATTTTCACCATAAATGATTGCGTTTTATATTTACTGGGGCT 

AACTAGNATTTTCTGNTATAGGTATTCTTTCCAACCTTTCTCTATTTTTTGTTACTCAA 

AGTGTAGTGGATGGACCGGAAGCATTGGGGTTCACCTGGGAGAATGGTTGGNAATGCAG 

AACCCTTAGACCCCACCCCAGCCCCTGTGAAA 




Ref 10.2 

Sequence of BAC9 using primer C3S12, which spans 

nucleotides 2909-2 9278 of the cDNA. Exon sequence is 

underlined and represents nucleotides 2931-2942. 

CAGATAGTAGG TGAATGCTGTGGTGGTTCATAAAATGTCATCTTTAGTTTGTATTCTCT 

CTGATGATTAGACTTTC AGAT C CAGATCTAATCATTTAGTAAGCCAGAT CTTGCCAAAT 

AAACTACTCCGTTAGAGAATAAGGACTTTTAATAGTTACAATAATACTCTTTCAAATCT 

TTTATGGCAGCAATAAAATAGTAATATTGTCTATTTTTTGAGACTATTTTCACACATAT 

TTTAGAAACCCCTGTATCCTTCAGAATTACTGCGACTTAACGGAGAAATATATAGTATA 

ATCCCACATTTTGTTGAAAAAGACAAAGAATTAAGTAGTAGCTAATAATTTGAACTAGA 

ACCAGAACCCTAAGAAATTTCTGACCCAAGCATATTATCTCTTTGGCTTAACTGGTTCC 

AGGNGAGGTATCTTTAGAACGTNAAAGCCTGAAATCACACCTTAAAAACACTTNCTTTA 

ACCCTTTATAANTTNCTTAATTTTCACCCATAAAATNGATTGCNGTTTTATATTTTACC 

TNGGGNCTANACCTNAGGCAATTTTTCTGGGTCATAAGGGAAATTTCTTTTTCCCAAAC 

CCTTTTCTTCTTATTTTGTTGGGNCACCTCCCAAAAGGTGNTCNGTTGGGGNTTNGGGN 

CCCCGNGAAAGGCCATTTGGGGGNTTCACCCCCGGGGGANGATTTGGTTTTGGAAAATN 

GTCNTSTNAAAACCTTCCNNACCCCCNNCCCCCNGGCCCCCNTGNNNGGAAATCC^^ 

ATCTTGNCNATTTTTTTANCCAAANGANCNCCCCCNAGGGNGGGATTTTNGTTATTTCC 

CCAANAAGANGTAAGGTTNTTGGCCTTTNGGGGCNTTGGGTGTTTTTNTTTNN 

Ref 11.1 

Bac 9 sequenced with HC3AS3, which spans nucleotides 3377- 
3 3 94 of the cDNA, Exon sequence is underlined and 
represents nucleotides 3268-3290. 

CTGANGTGTNGCAANGCCACTCCTGTGTCTTTTCTGCATGTCCTCAATAGCAACTTCAA 

TCGGTGTTAAGATGATC TGAGTNANNGAGCATCTGTTANATCAGNGTACTGACTGAAAC 

TATTTAATGAACTTTATGTATAATCAACTGAAATTAGANAAAAAAAAGATCAATNGTAA 

ACTTCATGTAACAATAAAATTCCAAACTTGGATTCTAAATGAANNAAAAANATCAACCT 

TTAAAGAAAAGCTGGGGGTGAATAAGGGCTTAGAAAAGANGTANAAAATGANGACTCAA 

AATGGTAAAGGGTCTAATATGNATGGATAAGGATGGACATATCTTCGGACTCTGAGTGG 

TGTACATGGCTTGATGATTGCTCACTATGTGTGNCATTATGGCTACCTCTCTTTAGGCA 

TGCCTGTTAANTAGGAAGCTGAACTANCAAAGNCTCTTNGATGTATNANTCCTGCCGCT 

NAAGAAGGGGNCGCNTGANNCAAATGATTTGCNATGTNTCTGCTATNATNGNAAGNGNT 

CCTNGANTNNTTCNGANAAANCTCTCNANGAGNCTAGTTTACATNCGGTCAGNGCTTCT 

TGCACCTCCTGNGCATCTCCCGTANTTCA.CCCTCA.TTlSnsrACCNTNANTTTATAANNA 

NAGCCCACNTNNCCTATAGGCNATCNACGCNNTTCCCmTANTCANTNNNAGAC^ 

TTTNNCGCCCCCTCCTNNTCCTTCCTNNCTTCCNCCCNNCNNCCCTNTNTCTNTNCCCC 

CNCCNNTTCTTANCTTNCT 




Ref 12.1 

Sequence of BAC9 using primer C3S15, which spans 

nucleotides 3751-3770 of the cDNA. Exon sequence is 

underlined and represents nucleotides 3780-4119, and also 

represent the 3 terminus of the transcriptional unit. 

TGTGAAAAGATCTATTGGAAAACAACATGGAATGGAATTCTGGAAATTATTATTNATTG 

AAGAATGCAGTGGCCAAGAAAATATCAAATGTAGATTGTTAACGCTTGAGAATCATGGC 

TATGGTTTCTAATGTTCTGGTAACAAGCTGTTATCTTTTAAGACATTTTAATGACTCAA 

AGGTACACTATACATTTACCATTATTTATACCATAGCTAAGGTTAAAAATTTATTCACT 

TTAAGTTCGTATTTTTTAATTTATATTACCATTTATAGATTCATTTTGGAACCATTTTA 

AATGTAGTAATGCTTATTTTAAAGGTACTATTAAATATGTGAATGTTTACA CTAATTTT 

ACCGAGTGGGACTTCAAAATTTTTATTATTGACAATGGCTGAGAACAATTNAAGGGTTT 

GACTCNAGAACTANTTCCAAACCTAGCAGAATAAAAATCATAGATAGCCCCAAATTAAT 

GAGTTTGGGNAACTGTNTCAAAGTTTTTTTCCATTTACATACCCAAAAACAGGAAATTT 

TAGAATTTGCCNGAACCTTTACCTTAAGANAAAAACCCTTTTGTGNTNAAAAATNTANT 

NTTAAAATTCCCGGGGGGANTAATCTTAATNACCCCGGGTGGGGCCANNCNCCCCNTTA 

TAACTTTGGAATTTAAAAATTCNTTTTTTNTNCAACCCCAAACTGNANTNGGGTNNTTT 

TNAAGGAAAACCTTTCCACTNGGAAGTTNNCTTTTAGGGNCCNANCCTNCNANAAANNG 

GGGAANATTGGGAAGTCTTCCCCTTCNTTNGGGGGGNGNCCCAAAAAATTCTTAATAAA 

ANCCCCGGGGCTCCCATTNTTAGNATTTTTTTTTTTGGCCCCACACTGTGTTNATTAAA 

NCCCCNCNTGCTAAAAATTTTTNNNGAAAANACCTNAACCCTTCTNNA 
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HC2A 

KIAA 

rat 

HC4 

HCl 
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HC5 



ASGNLDKNARFSAIYRQDSNKLSNDDMLKLLADFRKPEKMAKLPVILGNLDITIDNVSSD 



FPNYVNSSYIPTKQFETCSKTPITFEVEEFVPCIPKHTQPYTIYTNHLYVYPKYLKYDSQ 



VLHHHQNPEF YDE I K 

KSFAKARKIAICIEFKDSDEEDSQPLKCIYGRPGGPVFTRSAFAAVLHHHQNPEFYDEIK 



lELPTQLHEKHHLLLTFFHVSCDNSSKGSTKKRDWETQVGYSWLPLLKDGRWTSEQHI 
I EL PTQLHE KHHLLLTFFHVS CDNS S KGS TKKRD WETQ VGYS WL PLLKDGR WTS EQH I 



PVSANLPSGYIiGYQELGMGRHYGPEIKWVDGGKPLLKISTHLVSTVYTQDQHLHNFFQYC 
PVSANLPSGYIiGYQELGMGRHYGPEIKWVDGGKPLLKISTHLVSTVYTQDQHLHNFFQYC 



GPGPARSTVS I SLI SNS ARV 



QKTESGAQALGNELVKYLKSLHAMEGHVMIAFLPTILNQLFRVLT-RATQEEVAVNVTRV 
QKTESGAQALGNELVKYLKSLHAMEGHVMIAFLPTILNQLFRVLT-RATQEEVAVNVTRV 



MEIQVLIRFLSVILMQLFWVLPNMIHEDDVPISCPMV 

MSFLPII LNQLFKVLV - QNE EDE I TTTVTRV 

NRSRSLSNSNPDISGTPTSPDDEVRSIIGSKGLDRSNSWVNTGGPKAAPWGSNPSPSAES 
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HC2A 

KIAA 
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HC4 

HCl 
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IIHWAQCHEEGLESHLRSYVKYAYKAEPYVASEYKTVHEELTKSMTTILKPSADFLTSN 
IIHWAQCHEEGLESHLRSYVKYAYKAEPYVASEYKTVHEELTKSMTTILKPSADFLTSN 

LFHIVSKCHEEGLDSYLSSFIKYSFRPGKPSAPQAPLIHETLATMMIALLKQSADFIiAIN 

LPDIVAKCHEEQLDHSVQSYIKFVFKTR ACKERPVHEDLAKNVTGLLK-SNDSPTVK 

TQAMDRS CNRMS SHTETSS FLQTLTGRLP TKKLFHEELALQWWCSG - - S VR E 



Cadherin 



KLLRYSWFFFDVLIKSMAQHLIENSKVKLL RNQR 
KLLKYSWFFFDVLIKSMAQHLIENSKVKLI RNQR 



KLLKYSWFFFEIIAKSMATYLLEENKIKLTHGQR 
HVLKjHSWFFFAIILKSMAQHLIDTNKIQLPRPQR 
SALQQAWFFFELNtVKSMVHHLYFNDKLEAEfeKSR 



Cleavage 
^PASYHHAAETWNMLMPHITQKFGD 
F-PASYHHAVETWNMLMPHITQKFRD 



?PKAYHHALHSLFIiAIT- IVESQYAE 
?PESYQNELDNLVMVLSDHVIWKYKD 
^PERFMDDIAALVSTIASDIVSRFQK 1.1/1.2/2.1/2.2 



HC2A NPEASKNANHSLAVFIKRCFTFMDRGFVFKQIN NYIS - - CFAPGDEIkTLFEYKFEFL 

KIAA NPEASKNANHSLAVFIKRCFTFMDRGFVFKQIN NYIS- -CFAPGDPKTLFEYKFEFL 

rat 

HC4 IPKESRNVNYSLASFLKCCLTLMDRGFVFNLIN DYIS- -GFSPKDPKVLAEYKFEFL 

HCl ALEETRRATHSVARFLkIrCFTFMDRGCVFKMVN NYIS- -MFSSGDLKTLCQYKFDFL 

HC3 DTEMVERIiNTSIiAFFLNDLLSVMDRGFVFSLIKSCYKQVSSKLYSLPNPSVlJVSLRLDFL 3.1/3.2 

HC5 



HC2A 

KIAA 

rat 

HC4 

HCl 

HC3 

HC5 



RWCNHEHYI PLNLPM- 
RWCNHEHYI PLNLPM- 



PFGKGRIQR- 
PFGKGRIQR- 



YQDLQL DYSLTDEF 

YQDLQL D YS LTDE F 



QTICNHEHYI PLNLPM AFAKPKLQR VQDSNL EYSLSDEY 

QEVCQHEHFIPLCLPIRSANIPDPLTPSES TQELHASDMPEYSVTNEF 

RIICSHEHYVTLNLPCSLLTPPASPSPSVSSA'l|SQSSGFSTNVQDQKIANMFELS--VPF 4.1/4.2 
MNADTAPTSPCPSIS SQNSSSCSSFQDQKIASMFDRTSRVPA 



HC2A 

KIAA 

rat 

HC4 

HCl 

HC3 

HC5 



Cadherin 
EC motif 



CRNHFLVGL 
CRNHFLVGL 


LLRE 
LLRE 


^^GTALQEFRE VRLIAISVLKNLLIKHSFDDRYA^RSHQARIAT 

/GTALQEFRE VRLIAISVLKNLLIKHSFDDRYASRSHQARIAT 


CKHHFLVGL 
CRKHFLnjGI 
RQQHYLAGL 
SSTS-SPGL 


LLRE 
LLRE 
^LTE 
LFTE 


rs I ALQDNYE I RYTAI S VI KNLLI KHAFDTRYQHKNQQAKI AQ 

/GFALQEDQD VRHLALAVLKNLMAKHS FDDRYRE PRKQAQ IAS 

liAVILDPDAEGLFGLHKKVINMVHNLLSSHDSDPRYSDPQIKARVAM 
lAAALDAEGEGISEVQRKAVSAIHSLLSSHDLDPRCVKPEVKVKIAA 



HC2A LYLPLFGLLIENVQRINVRDVSPFPVNAG-MTVKDESLALPAVNPLVTPQKGSTLDNSLH 

KIAA LYLPLFGLLIENVQRINVRDVSPFPVNAG-MTVKDESLALPAVNPLVTPQKGSTLDNSLH 

rat 

HC4 LYLPFVGLLLENIQRLAGRDTLYSCAAMPNSASRDEFPCG FTSP- - AN- -RGSLS 

HCl LYMPLYGMLLDNMPRIYLKDLYPFTVNTSNdGSRDDLSTNGGFQSQTAIKHANSVDTSFS 

HC3 LYLPLIGIIMETVPQLYDFTETHNQRGRPICIATDDYESE SG SMIS 

HC5 LYLPLVGIILDALPQLCDFTVADTRRYR TSGSDEEQE GA GAIT 
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HC2A 

KIAA 

rat 

HC4 

HCl 

HC3 

HC5 



kdllgaisgi|aspyttstpninsvrnadsrgslistdsgnslpernseksnsldk|hqqss 
kdllgaisgiaspyttstpninsvrnadsrgslistdsgnslpernseksnsldkhqqss 



TDKDTAYGS FQNG - 
KDVLNSI^AFSS-- 
QTVAMAIAGTSVPQ 
QNVALAIAGNNFN- ■ 



Ref s 



HGIKREDSRGSLIP-EGATGFPDQGNTGEN TRQS 

lAISTVNHADSRASLASLDSNPSTNEKSSEKTDNCEKIPRPL 

LTRPGS FLLTSrrSGRQHT 

LKTSG-IVLSstiPYKQYN 



5.1/5.2 



HC2A TLGNSWRCDKLDQSEIKSLLMCFLYILKSMSDDALFTYWN-KASTSELMDFFTISEVCL 

KIAA TLGNSWRCDKLDQSEIKSLLMCFLYILKSMSDDALFTYWN-KASTSELMDFFTISEVCL 

rat 

HC4 STRSSVSQYNRLDQYEIRSLLMCYLYIVKMISEDTLLTYWN-KVSPQELINILILLEVCL 

HCl ALIGSTLRFDRLDQAETRSLLMCFLHIMKTISYETLIAYWQ-RAPSPEVSDFFSIl|3VCL 

HC3 TFSAESSRSLLICLLWVLKN-ADETVLQKWFTDLSVLQLNRLLDLLYLCV 

HC5 MLNADTTRNLMICFLWIMKN-ADQSLIRKWIADLPSTQLNRILDLLFICV 



HC2A HQFIqYMGKRYIARNQEGLG- -PIVHDRKS QTLPVSRNRTGMM 

KIAA HQFQYMGKRYIAR TGMM 

rat 

HC4 FHFRYMGKRNIARVHDAWLSKHFGIDRKS QTMPALRNRSGVM 

HCl QNFRYLGKRNI IRKIAAAF- - KFVQSTQNNGTLKGSNPSCQTSGLLAQWMHSTSRHEGHK 

HC3 SCFEYKGKKVFERMNSLTFK- -KSKDMRAK LEEAILGSIGARQEMV 

HC5 LCFEYKGKQSSDKVSTQVLQ--KSRDVKAR LEEALLRGEGARGEMM 



HC2 A HARLQQL GSLDNS LTFNHS YGHSDADVLHQSLLEANI ATEVC 

KIAA HARLQQL GS LDNS LT FNHS YGHS DADVLHQS LLE AN I ATEVC 

rat 

HC4 QARLQHL SSLESS FTLNHS STTT EAD I FHQALLEGNTATE VS 

HCl QHRSQTLPIIRGK NALSNPKL LQMLDNTMllSNSNEIDIVHHVDTEANIATEGC 

HC3 rrsrgqiJerspsgsafgsqenlrwrkdmthwrqntekldksraeiehealidgnlatean 6.1/6.2 

HC5 RRRAPGNDRFP GLNENLRWKKEQTHWRQANEKLDKTKAELDQEALISGNLATEAH 



HC2A LTALDTLSLFTLAFK|NQLLADHGHNPLMKKVFDVYLCFLQKHQSETALKNVFTALRSLIY 

KIAA LTALDTLSLFTLAFKNQLLADHGHNPLMKKVFDVYLCFLQKHQSETALKNVFTALRSLIY 

rat KLSRGHSPLMKKVFDVYLCFLQKHQSEMALKNVFTALRSLI Y 

HC4 LTVLDTISFFTQCFKTHFLNNDGHNPLMKKVFDIHLAFLKNGQSEVSLKHVFASLRAFIS 

HCl LTILDLVSLFTQTHQRQLQQCDCQNSLMKRGFDTYMLFFQVNQSATALKHVFASLRLFVC 

HC3 LIILDTLEIWQTVS- -VTES- -KESILGGVLKVLLHSMACNQSAVYLQHCFATQRALVS 

HC5 LIILDMQENIIQASS- -ALDC- -KDSLLGGVLRVLVNSLNCDQSTTYLTHCFAfTLRALIA 



HC2A KFPSTFYEGRADMCAALCYEILKCCNSKLSSIRTEASQLLYFLMRNNFDYTGKKSFVRTH 

KIAA KFPSTFYEGRADMCAALCYEILKCCNSKLSSIRTEASQLLYFLMRNNFDYTGKKSFVRTH 

rat KFPSTFYEGRADMCASLCYEVLKCCNSKLSS IRTEASQLLYFLMRNNFDYTGKKSFVRTH 

HC4 KFPSAFFKGRVNMCAAFCYEVLKCCTSKI SSTRNEASALLYLLMRNNFE YTKRKTFLRTH 

HCl KFPSAFFQGPADLCGSFCYEVLKCCNHRSRSTQTEASALLYLFMRKNFEFNKQKSIVRSH 

HC3 k|fpellfeeeteqcadlclrllrhcsssigtirshpsaslyllmrqnfeign- -NFARVK 7.1/7.2 

HC5 KFGDLLFEEEVEQCFDLCHQVLHHCSSSMDVTRSQACATLYLLMRFSFGATS - -NFARVK 

HC2A lqviisvsqliadwgigetrfqqslsiinncansdrlikhtsfssdvkdltkrirtvlm 

KIAA lqviisvsqliadwgiggtrfqqslsiinncansdrlikhtsfssdvkdltkrirtvlm 

rat lqviislsqliadwgiggtrfqqslsiinncansdrlikhtsfssdvkdltkrirtvlm 

HC4 lqiiiavsqliadvalsggsrfqeslfiinnfansdrpmlarafpaevkdltkrirtvlm 

HCl LCtlJ^VSQLIAD-AGIGGSRFQHSIAITNNFANGDKQMKl^SNFPAEVKDLTKRIRTVLM 

HC3 mqvpmslsslvgtsqnfneeflrrslktiltyaeedlelrettfpdqvqdlvfnlhmils 

HC5 mqvtmslaslvgrapdfneehlrrslrtilayseedtamqmtpfptqveellcnlnsily 
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ATAQMKEHENDPEMLVDLQYSLAKSYASTPELRKTWLDSMARIHVKNGE 
ATAQMKEHENDPEMLVDLQYSLAKSYASTPELRKTWLDSMARIHVKNGE 
ATAQMKEHENDPEMLVDLQYSLAKSYASTPELRKTWLDSMARIHVKNGE 
ATAQMKEHEKDPEMLIDLQYSLAKSYASTPELRKTWLDSMAKIHVKNGE 
ATAQMKEHEKDPEMLVDLQYSLANSYASTPELRRTWLESMAKIHARNGE 



DTVKMKEHQEDPEMLIDLMYRIAKGYQTSPDLRLTWLQNMAGKHSERS^HAEAAQCLVHS 
DTVKMREFQEDPEMLMDLMYRIAKSYQASPDLRLTWLQNMAEKHTKKKC YTEAAMCLVHA 



lseaamcyvhv 
lseaamcyvhv 
lseaamcyvhv 
fseaamcyvhv 
lseIaamcyihi 



TALVAEYI TRKGV 



AALVAEYLSMLED 
AALVAEYISMLED 



TRKEA- • 
TRKEAD- 



TALVAEYI 
TALVAEYI 
AALVAEFIHRKKL 
AALIAEYI KRKGYWKVEKK 



SH3 



FRQGCTAFRVITPN 

■VQWEPPLLPHSHSACLRRjSRGGVFRQGCTAFRVITPN 
LALQREPPVFPYSHTSCQRKPRGGMFRQGCTAFRVITPN 

FPNGCSAFKKITPN 



arASLLSEDTHPCDSNSLLTTPlSGGSMFSMGWPAFLSITPN 

- RKYLP VGCVTFdN I S SN 
-HSYLPVGSVSFQNISSN 



IT AM 



IPI 
IPI 



I DE EAS MMEDVCi^QD VHFNEDVLMELLEQCADGLWKAER ^ELIADIYKLI 

I DEEASMMEDVGMQD VHFNEDVLMELLEQCADGLWKAER^ELIADIYKLI 

IDEEASMMEDVGMQD VHFNEDVLMELLEQCADGLWKAER LRAGL LTSINSSSP 

IDEEGAMKEDAGMMD VH YS EEVLLELLEQCVNGLWKAERjif E 1 1 

IKEEGAAKEDSGMHD TPYNEjWILVEQLYMCGEFLWKSER ^ELI 



seiskligpi 
I adv nkpi iav 

vleesavsddwspdeegicsgkyfte'sglvglleqaaasfsmagmyeavnev|ykviJipi 
vleeswsedtlspdedgvcagqyftesglvglleqaaelfstgglyetvnevyklvipi 



8.1/8.2 



HC2A 

KIAA 

rat 

HC4 

HCl 

HC3 

HC5 



ITAM 

YEKlfeo 

YEKRRDFERLAHL 
SMKSGGTLETTHL 



ITAM 



ITAM 



ITAM 



YDTIHRA 



YSKV TEVMHSGRRLLGT iTFRV \FFGQAAQYQFTDSETDVE 



YDTIHRPYSKVTEVITR A AGSWDLLPGGLFGQ 

YENRREFENLTQv|yRTi1hGa|yTKI LEVMHTKKRLLG TPFRVAFYGQ 

LKV AEWNSEKRLFG R 

HEANRDAKKLSTIHGKLQEAFSKIVHdSTGWERMFG T 

LEAHREFRKLTLTHSKLQRAFDSIVNKDH--KRMFG T 



iTYRVAFYGQ 
iTFRVSFYG- 
fFRVSFFG- 



9.1 



ITAM 



ITAM 



HC2A 

KIAA 

rat 

HC4 

HCl 

HC3 

HC5 



- FFEDEDGKE YI YKE PKLTPLSEISQRLLKL fSDK ? 
GFFEDEDGKEYIYKEPKLTPLSEISQRLLKLSirSDK? 
GFFEDEDGKEYIYKEPKLTPLSEISQRLLKLSfSDK? 
SFFEEEDGKE YI YKE PKLTGLSEISLRLVKL fGEK ? 
GFFEEEEGKE YIYKE PKLTGLSEISQRLLKL !fADKp 
TKFGDLDEQE FVYKE PAITKLAEISHRIJegI? fGER 
S KFGDLDEQE FVYKE P AI TKL PE I SHRLEAF fGQC 



'GSENVKMIQDSGllVNPKDLDSKpA 
'GSENVKMIQDSGKVNPKDLDSK f A 
•GSENVKMIQDSGKVNPKDLDSK 7A 
'GTENVKI IQDSDKVNAKELDPK I A 
•GADNVKI IQDSNKVNPKDLDPKgA 
•GED WE VI KDSNPVDKCKLDPNKA 10 . l/lO . 2 
'G AE F VE V I KD S T P VD Kfr KL D PNKA 



ITAM 



HC2A 

KIAA 

rat 

HC4 

HCl 

HC3 

HC5 



y3qvthvipffdekelqerktefershnirrfmfempftqtgkrqggveeqckrf|tilta 
yiqvthvipffdekelqerktefershnirrfmfempftqtgkrqggveeqckrrtilta 
yiqvthvtpffdekelqerkteferchnirrfmfempftqtgkrqggveeqckrrtilta 
hiqvtyvkpyfddkelterktefernhnisrfvfeapytlsgkkqgcieeqckrrtiltt 



yiqutyvtpffeekeiedrktdfemhhninrfvfetpftlsgkkhggvaeqckrrtiltfr 
yiqityvepyfdtyemkdrityfdknynlrrfmyctpftldgrahgelheqfkrktiltt 
yiqitfvepyfdeyemkdrvtyfeknfnlrrfmyttpftlegrprgelheqyrrntvltt 
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HC2A 

KIAA 

rat 

HC4 

HCl 

HC3 

HC5 



HC2A 
KIAA 
rat 

HC4 
HCl 
HC3 
HC5 



HC2A 

KIAA 

rat 

HC4 

HCl 

HC3 

HC5 



HC2A 

KIAA 

rat 

HC4 

HCl 

HC3 

HC5 



PVMYQHHTDLNP lEVAIDEMSKKVAELRQLCSSAEVDMIKLQLKLQG SV 



Coiled-Coil 1 



Ref S 



IHCFPYVKKRI 
IHCFPYVKKRI 

IHCFPYVKKRI PVMYQHHTDLNP 
SNSFPYVKKRIPINCEQQINLKP 

SHLFPYVKKRIQVISQSSTELNPIEVAIDEMSRKVSELNQLCTMEEVDMISLQLKLQdSV 
SHAFPYIKTRVNVTHKE|EI ILTP lEVAIEDMQKKTQELAFATHQDPADPKMLQMVLQG SV 11 . 1 
MHAFPYIKTRISVIQKEEFVLTPIEVAIEDMKKKTLQIiAVAINQEPPDAKMLQMVLQG SV 



PVMYQHHTDLNP I EVA I DEMS KKVAELRQLCS S AEVDM I KLQLKLQG S V 

lEVAIDEMSKKVAELHQLCSSAEVDMIKLQLKLQG SV 
IDGATDEIKDKTAELQKLCSSTDVDMIQLQLKLQGWV 



S VQIvNAGPLAYARAFLDDTNTKRYPDNKVKLLKEVFRQFVEACGQA IjAVNERLI kedqle 
S VQVNAGPLAYARAFLDDTNTKRYPDNKVKLLKEVFRQFVEACGQA [iAVNERLI KEDQLE 
S VQVNAGPLAYARAFLDDTNTKRYPDNKVKLLKEVFRQFVEACGQA jAVNERLI kedqle 
SVQVNAGPLAYARAFLNDSQASKYPPKKVSELKDMFRKFIQACSIA :jELNERLIKEDQVE 
S VKVNAGPMAYARAFLEETNAKKYPDNQVKLLKE I FRQFADACGQA jDVNERL I KEDQLE 
GTTVNQGPLEVAQVFLSEIPSDPKLFRHHNKLRLCFKDFTKRCEDAliRKNKSLIGPVQKE 
GATVNQGPLEVAQVFLAEIPADPKLYRHHNKLRLCFKEFIMRCGEA^EKNKRLITADQRE 



Coled-Coi l 2 



Coiled-Coil 2 



YQEEMKANYREMAKELSEIMHE^ICPLEEKTS-VLPNSLHIFNAISGTPTSTMVHGMTSS 

YQEEMKANYREMAKELSEIMHE3LG 

YQEEMKANYREIRKELSDIIVPRICPGEDKRATKFPAHLQRHQRDTNKHSGSRVDQFILS 
YHEGLKSNFRDMVKELS D 1 1 HE 3 1 LQEDTMHS PWMSNTLHVFCA I SGTS SDRG YGS PRYA 

yqeelrshykdmlselstvi4ne3itgrddlsk RGVDQTCTRVISKATPALPTVSISS 

YQRELG KLSS P 

YQQELKKNYNKLKENLRPMIER KI PELYKPI frvesqkrdsfhrssfrkcetqlsqgs - - 



PBM 



ssw 



cvtlphephvgtcfvmcklrttfranhwfcqaqeeamgngrekepwtvifnsrfyrswgk 

EV ^ 

SAEV 



HC2A 

KIAA 

rat VHIFF 

HC4 

HCl 

HC3 

HC5 
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genomic DNA BAG 9 DNA 



5 



-21 

GTCGCCGTCGCCGCAGCAGCC -1 
1/1 

ATG GCC GAG CGC CGC GCC TTC GCC CAG AAG 
Met ala glu arg arg ala phe ala gin lys 
61/21 

AAG CAG ATC TCC GGA CAA TAT AGT GGT TCT 
lys gin ile ser gly gin tyr ser gly ser 
121/41 

GGC AAT ATA TCC CAT CAC ACC ACA GTG CCC 
gly asn ile ser his his thr thr val pro 
181/61 

GAA GAT TAG CTC ATT ACT CAT CCT TTG GCT 
glu asp tyr leu ile thr his pro leu ala 
241/81 

GAA TTT CCT CCA GAT GAT ATT GAA GTT GTT 
glu phe pro pro asp asp ile glu val val 
301/101 

TCA GCT GTA CCT GAA GAA AGT GAA ATG GAT 
ser ala val pro glu glu ser glu met asp 
361/121 

ACA GAA GAC TGG GCA ATT GTC ATC AGA AAA 
thr glu asp trp ala ile val ile arg lys 
421/141 

AAT ACA TTA GAT AAA CAG AAA GAA AGG CAA 
asn thr leu asp lys gin lys glu arg gin 
481/161 

GAT GAA GCT CCA GAT GGC AAC AGC TAC CAG 
asp glu ala pro asp gly asn ser tyr gin 
541/181 

ATG TCA ATA GAT GAT ACC CCA AGG GGT AGC 
met ser ile asp asp thr pro arg gly ser 
601/201 

TCA CTT CCT GAT GCT TTG CTT CCC AAT TTA 
ser leu pro asp ala leu leu pro asn leu 
661/221 

CGT CAG AAT GAT GAC CAA AGG AAA TCA AAC 
arg gin asn asp asp gin arg lys ser asn 
721/241 

TCA CCA GAT GAG GAA GAA CCA ATA GAA CGG 
ser pro asp glu glu glu pro ile glu arg 
781/261 

TTT GGT CAA AGA CTT CTT GTA AAA TGC TTA 
phe gly gin arg leu leu val lys cys leu 
841/281 

ATT TTT GCA AGT TTG GCT TTA TAT GAT GTC 
ile phe ala ser leu ala leu tyr asp val 
901/301 

TAT TTT GAC CTT AAT TCT GAG CAG ATG AAA 
tyr phe asp leu asn ser glu gin met lys 
961/321 

GCC ATT ACT ACC CTG GCA AGA TCA GCA ATT 
ala ile thr thr leu ala arg ser ala ile 



31/11 

ATC AGC AGA ACG GTG GCA GCC GAA GTT AGG 
ile ser arg thr val ala ala glu val arg 
91/31 

CCC CAA CTG CTC AAA AAC CTT AAT ATT GTT 
pro gin leu leu lys asn leu asn ile val 
151/51 

CTT ACC GAA GCA GTA GAT CCA GTG GAT TTG 
leu thr glu ala val asp pro val asp leu 
211/71 

GTG GAT TCT GGG CCT TTA CGG GAT TTG ATT 
val asp ser gly pro leu arg asp leu ile 
271/91 

TAT AGT CCT CGG GAC TGC AGA ACT CTT GTT 
tyr ser pro arg asp cys arg thr leu val 
331/111 

CCA CAT GTT AGA GAC TGT ATA AGA AGT TAT 
pro his val arg asp cys ile arg ser tyr 
391/131 

TAT CAT AAA TTG GGA ACA GGA TTT AAT CCC 
tyr his lys leu gly thr gly phe asn pro 
451/151 

AAA GGT TTG CCA AAA CAA GTT TTT GAA TCT 
lys gly leu pro lys gin val phe glu ser 
511/171 

GAT GAT CAA GAT GAC CTT AAA AGA CGT TCA 
asp asp gin asp asp leu lys arg arg ser 
571/191 

TGG GCC TGT AGT ATC TTT GAC TTG AAA AAT 
trp ala cys ser ile phe asp leu lys asn 
631/211 

CTT GAT CGA ACT CCA AAT GAA GAA ATA GAC 
leu asp arg thr pro asn glu glu ile asp 
691/231 

CGT CAC AAA GAA CTT TTT GCT TTG CAT CCA 
arg his lys glu leu phe ala leu his pro 
751/251 

CTT AGT GTT CCT GAT ATA CCC AAA GAA CAT 
leu ser val pro asp ile pro lys glu his 
811/271 

TCA CTC AAG TTT GAA ATT GAA ATT GAA CCC 
ser leu lys phe glu ile glu ile glu pro 
871/291 

AAG GAA AAG AAA AAG ATT TCA GAA AAC TTT 
lys glu lys lys lys ile ser glu asn phe 
931/311 

GGG TTG TTA CGT CCA CAT GTA CCA CCT GCT 
gly leu leu arg pro his val pro pro ala 
991/331 

TTT TCT ATC ACT TAT CCT TCC CAA GAT GTT 
phe ser ile thr tyr pro ser gin asp val 



FIG. 6A (1 of 6) 



1021/341 

TTT CTT GTA ATA AAG 
phe leu val ile lys 
1081/361 

CCA TAT ATG ATT TTC 
pro tyr met ile phe 
1141/381 

AAG AGT CAA GCA GAT 
lys ser gin ala asp 
1201/401 

ACT GCA ATC CAT TTA 
thr ala ile his leu 
1261/421 

GAA GTA GAA ATC AGT 
glu val glu ile ser 
1321/441 

ATT GTT GGC AGA CGA 
ile val gly arg arg 
1381/461 

AGC TTT CGA CCA GCT 
ser phe arg pro ala 
1441/481 

AGT GAT GAA GAT CTC 
ser asp glu asp leu 
1501/501 

CGA CTA AGA CCT ATT 
arg leu arg pro ile 
1561/521 

CAT TAT TGC CTA ACT 
his tyr cys leu thr 
1621/541 

CCT ACC AGA GAA ATC 
pro thr arg glu ile 
1681/561 

AGA AAT CTT CTC TAC 
arg asn leu leu tyr 
1741/581 

AGA AAT ATA ACA GTG 
arg asn ile thr val 
1801/601 

GTA ATC TTT GGT AAA 
val ile phe gly lys 
1861/621 

TAT CAT AAC AGG TCT 
tyr his asn arg ser 
1921/641 

ACT GAC CAT CAT CAC 
thr asp his his his 
1981/661 

ACT CCT CTT GAA ACA 
thr pro leu glu thr 
2041/681 

AAG ACT GGC CAG TTT 
lys thr gly gin phe 
2101/701 

CTG TCT CCT GAG GTT 
leu ser pro glu val 



CTA GAA AAA GTC CTA 
leu glu lys val leu 

AAA GAA GCA GAT GCC 
lys glu ala asp ala 

CAG TTT TGC CAA AGA 
gin phe cys gin arg 

ATG AAT ATT GTT AGC 
met asn ile val ser 

ACT GGA GAA CGA AAA 
thr gly glu arg lys 

TCA CTT GAA AGG ACA 
ser leu glu arg thr 

ACT CTC ACA GTG ACA 
thr leu thr val thr 
* 

TAC AAA TTC CTT GCT 
tyr lys phe leu ala 

ACA GCT CAG CTC AAG 
thr ala gin leu lys 

CCG GAG CTG CTT CAA 
pro glu leu leu gin 

TTA GAG TTT CCC GCA 
leu glu phe pro ala 

ATA TAC CCT CAG AGT 
ile tyr pro gin ser 

AAA GTC CAG TTT ATG 
lys val gin phe met 

TCT AGC TGT TCA GAA 
ser ser cys ser glu 

CCT GAT TTT CAT GAA 
pro asp phe his glu 

TTG CTT TTT ACT TTT 
leu leu phe thr phe 

CCA GTT GGA TAT ACA 
pro val gly tyr thr 

TGC TTG CCA GTC TCA 
cys leu pro val ser 

CCT CTA CCT GGC ATG 
pro leu pro gly met 



1051/351 

CAG CAA GGA GAC ATT 
gin gin gly asp ile 
1111/371 

ACC AAG AAT AAA GAA 
thr lys asn lys glu 
1171/391 

CTT GGG AAA TAT CGC 
leu gly lys tyr arg 
1231/411 

AGT GCT GGG AGT TTG 
ser ala gly ser leu 
1291/431 

GGG TCT TGG TCA GAG 
gly ser trp ser glu 
1351/451 

ACA AGT GGA GAT GAT 
thr ser gly asp asp 
1411/471 

AAT TTT TTT AAG CAG 
asn phe phe lys gin 
1471/491 

GAT ATG AGA AGG CCA 
asp met arg arg pro 
1531/511 

ATA GAC ATT TCT CCC 
ile asp ile ser pro 
1591/531 

GTG AAG CTT TAC CCT 
val lys leu tyr pro 
1651/551 

AGG GAT GTT TAT GTT 
arg asp val tyr val 
1711/571 

CTT AAT TTT GCC AAT 
leu asn phe ala asn 
1771/591 

TAT GGA GAG GAT CCA 
tyr gly glu asp pro 
1831/611 

TTT TCA AAG GAA GCC 
phe ser lys glu ala 
1891/631 

GAA ATC AAG GTT AAG 
glu ile lys val lys 
1951/651 

TAT CAT GTT AGT TGT 
tyr his val ser cys 
2011/671 

TGG ATA CCA ATG CTT 
trp ile pro met leu 
2071/691 

TTG GAA AAA CCA CCA 
leu glu lys pro pro 
2131/711 

AAA TGG GTA GAT AAT 
lys trp val asp asn 



GGA GAG TGT GCA GAA 
gly glu cys ala glu 

AAA CTG GAG AAA CTG 
lys leu glu lys leu 

ATG CCT TTT GCT TGG 
met pro phe ala trp 

GAA AGA GAT TCT ACA 
glu arg asp ser thr 

AGG AGG AAT TCT AGT 
arg arg asn ser ser 

GCT TGT AAC TTG ACG 
ala cys asn leu thr 

GAA GGA GAC CGC TTA 
glu gly asp arg leu 

TCT TCT GTC TTA CGG 
ser ser val leu arg 

GCA CCT GAA AAT CCC 
ala pro glu asn pro 

GAC AGT AGA GTT AGA 
asp ser arg val arg 

CCA AAC ACT ACT TAC 
pro asn thr thr tyr 

CGT CAA GGT TCT GCT 
gly ser ala 

AGC AAT GCC ATG CCG 
ser asn ala met pro 

TAT ACA GCC GTA GTA 
tyr thr ala val val 

CTT CCT GCT ACT TTA 
leu pro ala thr leu 

CAA CAA AAA CAA AAT 
gin gin lys gin asn 

CAG AAT GGA CGG TTG 
gin asn gly arg leu 

CAG GCT TAT TCT GTA 
gin ala tyr ser val 

CAC AAA GGT GTT TTT 
his lys gly val phe 
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2161/721 

AAT GTT GAA GTT GTT 
asn val glu val val 
2221/741 

TTT GCT CTG GTC AAT 
phe ala leu val asn 
2281/761 

ATC ATG GAA AAT AAC 
ile met glu asn asn 
2341/781 

GAG CTG GAA CCA GTG 
gin leu glu pro val 
2401/801 

ATT AGA CCT CCT GTC 
ile arg pro pro val 
2461/821 

ATG GCA TCA ATT ATA 
met ala ser ile ile 
2521/841 

AGA AAC AGC CTT CTT 
arg asn ser leu leu 
2581/861 

AAT TCA TCA TCA CCA 
asn ser ser ser pro 
2641/881 

AGA TCT GCG GTG AGA 
arg ser ala val arg 
2701/901 

AAT CCA GAT ATA TCT 
asn pro asp ile ser 
2761/921 

AGT AAG GGT TTA GAT 
ser lys gly leu asp 
2821/941 

TGG GGA TCC AAC CCC 
trp gly ser asn pro 
2881/961 

CGT ATG TCT TCG CAC 
arg met ser ser his 
2941/981 

ACT AAA AAG CTT TTT 
thr lys lys leu phe 
3001/1001 

CGG GAA TCA GCT TTG 
arg glu ser ala leu 
3061/1021 

CAC CAT TTA TAC TTT 
his his leu tyr phe 
3121/1041 

TTC ATG GAT GAC ATT 
phe met asp asp ile 
3181/1061 

CAG AAG GAC ACA GAA 
gin lys asp thr glu 
3241/1081 

CTG TTG TCT GTT ATG 
leu leu ser val met 



GCT GTT TCG TCT ATC 
ala val ser ser ile 

GCT CTG GAT GAA CAC 
ala leu asp glu his 

TTA GAA AAT GAA TTG 
leu glu asn glu leu 

GTC CGA TTT CTT CAT 
val arg phe leu his 

ATT GCT GGC CAA ATA 
ile ala gly gin ile 

AAT CGA CTT CAC AAA 
asn arg leu his lys 

GCA TCA TAT ATT CAT 
ala ser tyr ile his 

GGT CCT GGG GGT TTG 
gly pro gly gly leu 

CCT GCA AGC CTT AAT 
pro ala ser leu asn 

GGG ACT CCC ACG TCA 
gly thr pro thr ser 

CGC TCC AAT TCC TGG 
arg ser asn ser trp 

AGT CCA AGT GCA GAA 
ser pro ser ala glu 

ACA GAG ACG TCA AGT 
thr glu thr ser ser 

CAC GAG GAG CTG GCT 
his glu glu leu ala 

CAA CAA GCC TGG TTC 
gin gin ala trp phe 

AAT GAT AAA CTT GAG 
asn asp lys leu glu 

GCA GCT CTT GTC AGC 
ala ala leu val ser 

ATG GTT GAG AGA CTC 
met val glu arg leu 

GAC AGA GGA TTT GTT 
asp arg gly phe val 



2191/731 

CAT ACA CAA GAT CCT 
his thr gin asp pro 
2251/751 

CTG TTC CCA GTC CGA 
leu phe pro val arg 
2311/771 

AAG AGC AGT ATT TCA 
lys ser ser ile ser 
2371/791 

CTT CTG CTA GAT AAA 
leu leu leu asp lys 
2431/811 

GTT AAC CTA GGT CAA 
val asn leu gly gin 
2491/831 

AAC TTG GAA GGA AAT 
asn leu glu gly asn 
2551/851 

TAT GTT TTC CGC CTA 
tyr val phe arg leu 
2611/871 

GGA GGA TCA GTG CAT 
gly gly ser val his 
2671/891 

TTA AAT CGT TCT CGA 
leu asn arg ser arg 
2731/911 

CCA GAT GAT GAA GTT 
pro asp asp glu val 
2791/931 

GTT AAC ACT GGT GGT 
val asn thr gly gly 
2851/951 

TCA ACA CAG GCT ATG 
ser thr gin ala met 
2911/971 

TTC TTA CAA ACA TTA 
phe leu gin thr leu 
2971/991 

TTG CAG TGG GTT GTT 
leu gin trp val val 
3031/1011 

TTT TTT GAA TTA ATG 
phe phe glu leu met 
3091/1031 

GCT CCA AGG AAA AGT 
ala pro arg lys ser 
3151/1051 

ACG ATT GCT AGT GAT 
thr ile ala ser asp 
3211/1071 

AAT ACA AGC CTT GCA 
asn thr ser leu ala 
3271/1091 

TTT AGC CTT ATA AAG 
phe ser leu ile lys 



TAT CTT GAC AAA TTT 
tyr leu asp lys phe 

ATT GGG GAC ATG CGA 
ile gly asp met arg 

GCA CTG AAT TCA TCC 
ala leu asn ser ser 

CTG ATA CTT TTA GTT 
leu ile leu leu val 

GCA TCT TTT GAA GCC 
ala ser phe glu ala 

CAT GAC CAG CAT GGC 
his asp gin his gly 

CCA AAT ACT TAC CCT 
pro asn thr tyr pro 

TAT GCC ACA ATG GCT 
tyr ala thr met ala 

AGC CTT AGT AAT AGC 
ser leu ser asn ser 

CGA TCA ATC ATC GGG 
arg ser ile ile gly 

CCA AAA GCT GCC CCA 
pro lys ala ala pro 

GAT CGA AGT TGT AAT 
asp arg ser cys asn 

ACG GGA CGC TTA CCA 
thr gly arg leu pro 

TGC AGT GGC AGC GTT 
cys ser gly ser val 

GTA AAG AGC ATG GTG 
val lys ser met val 

CGT TTT CCA GAA CGT 
arg phe pro glu arg 

ATA GTT TCA CGA TTT 
ile val ser arg phe 

TTC TTT CTC AAT GAT 
phe phe leu asn asp 

TCC TGC TAT AAA CAG 
ser cys tyr lys gin 
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3301/1101 
















GTG TCT TCA 


AAG 


CTT 


TAC 


TCA 


TTA 


CCG 


AAT 


val ser ser 


lys 


leu 


tyr 


ser 


leu 


pro 


asn 


3361/1121 
















TTT CTA CGA 


ATC 


ATC 


TGC 


AGT 


CAT 


GAG 


CAC 


phe leu arg 


ile 


ile 


cys 


ser 


his 


glu 


his 


3421/1141 
















CTT ACT CCA 


CCT 


GCA 


TCT 


CCA 


TCA 


CCT 


TCT 


leu thr pro 


pro 


ala 


ser 


pro 


ser 


pro 


ser 


3481/1161 
















TTT TCT ACQ 


AAT 


GTA 


CAA 


GAC 


CAA 


AAG 


ATT 


phe ser thr 


asn 


val 


gin 


asp 


gin 


lys 


ile 


3541/1181 
















CGC CAA CAG 


CAT 


TAT 


TTG 


GCA 


GGA 


CTT 


GTG 


arg gin gin 


his 


tyr 


leu 


ala 


gly 


leu 


val 


3601/1201 
















GAT GCT GAA 


GGA 


CTG 


TTT 


GGA 


TTG 


CAT 


AAG 


asp ala glu 


9iy 


leu 


phe 


9iy 


leu 


his 


lys 


3661/1221 
















TCC AGT CAC 


GAC 


TCA 


GAC 


CCG 


CGG 


TAC 


TCT 


ser ser his 


asp 


ser 


asp 


pro 


arg 


tyr 


ser 


3721/1241 
















TTG TAT CTA 


CCT 


CTG 


ATT 


GGT 


ATT 


ATC 


ATG 


leu tyr leu 


pro 


leu 


ile 


gly 


ile 


ile 


met 


3781/1261 
















GAA ACT CAC 


AAT 


CAA 


CGA 


GGA 


AGA 


CCA 


ATT 


glu thr his 


asn 


gin 


arg 


gly 


arg 


pro 


ile 


3841/1281 
















AGC GGA AGT 


ATG 


ATA 


AGC 


CAG 


ACC 


GTT 


GCC 


ser gly ser 


met 


ile 


ser 


gin 


thr 


val 


ala 


3901/1301 
















CTA ACA AGG 


CCT 


GGC 


AGT 


TTC 


CTC 


CTC 


ACG 


leu thr arg 


pro 




ser 


phe 


leu 


leu 


thr 


3961/1321 
















TCA GCA GAA 


TCA 


AGT 


CGA 


AGC 


CTT 


TTG 


ATC 


ser ala glu 


ser 


ser 


arg 


ser 


leu 


leu 


ile 


4021/1341 
















GAA ACA GTT 


CTA 


CAG 


AAG 


TGG 


TTT 


ACA 


GAT 


glu thr val 


leu 


gin 


lys 


trp 


phe 


thr 


asp 


4081/1361 
















GAT CTG CTT 


TAT 


CTC 


TGT 


GTG 


TCT 


TGC 


TTT 


asp leu leu 


tyr 


leu 


cys 


val 


ser 


cys 


phe 


4141/1381 
















ATG AAT AGC 


TTG 


ACC 


TTT 


AAG 


AAA 


TCA 


AAA 


met asn ser 


leu 


thr 


phe 


lys 


lys 


ser 


lys 


4201/1401 
















CTT GGG AGC 


ATA 


GGT 


GCC 


AGG 


CAA 


GAA 


ATG 


leu gly ser 


ile 


gly 


ala 


arg 


gin 


glu 


met 


4261/1421 
















AGC CCA TCT 


GGA 


AGT 


GCC 


TTT 


GGA 


AGT 


CAA 


ser pro ser 


gly 


ser 


ala 


phe 


gly 


ser 


gin 


4321/1441 
















CAC TGG CGT 


CAA 


AAC 


ACA 


GAG 


AAG 


CTT 


GAC 


his trp arg 


gin 


asn 


thr 


glu 


lys 


leu 


asp 


4381/1461 
















CTG ATT GAT 


GGA 


AAC 


CTG 


GCT 


ACA 


GAA 


GCA 


leu ile asp 


gly 


asn 


leu 


ala 


thr 


glu 


ala 



3331/1111 
















CCC AGT GTT 


CTG 


GTG 


TCC 


TTG 


AGG 


CTG 


GAT 


pro ser val 


leu 


val 


ser 


leu 


arg 


leu 


asp 


3391/1131 
















TAT GTT ACA 


TTA 


AAC 


TTA 


CCC 


TGC 


AGC 


TTA 


tyr val thr 


leu 


asn 


leu 


pro 


cys 


ser 


leu 


3451/1151 
















GTT TCT TCT 


GCA 


ACA 


TCT 


CAG 


AGT 


TCT 


GGA 


val ser ser 


ala 


thr 


ser 


gin 


ser 


ser 


gly 


3511/1171 
















GCA AAT ATG 


TTT 


GAA 


TTA 


TCC 


GTG 


CCT 


TTC 


ala asn met 


phe 


glu 


leu 


ser 


val 


pro 


phe 


3571/1191 
















TTA ACA GAG 


CTG 


GCT 


GTC 


ATT 


TTA 


GAC 


CCT 


leu thr glu 


leu 


ala 


val 


ile 


leu 


asp 


pro 


3631/1211 
















AAA GTC ATC 


AAT 


ATG 


GTA 


CAC 


AAT 


TTA 


CTC 


lys val ile 


asn 


met 


val 


his 


asn 


leu 


leu 


3691/1231 
















GAC CCT CAG 


ATA 


AAG 


GCT 


CGA 


GTG 


GCC 


ATG 


asp pro gin 


ile 


lys 


ala 


arg 


val 


ala 


met 


3751/1251 
















GAA ACT GTA 


CCT 


CAG 


CTG 


TAT 


GAT 


TTT 


ACA 


glu thr val 


pro 


gin 


leu 


tyr 


asp 


phe 


thr 


3811/1271 
















TGT ATA GCC 


ACT 


GAT 


GAT 


TAT 


GAA 


AGT 


GAG 


cys ile ala 


thr 


asp 


asp 


tyr 


glu 


ser 


glu 


3871/1291 
















ATG GCA ATC 


GCA 


GGG 


ACA 


TCG 


GTC 


CCT 


CAA 


met ala ile 


ala 


gly 


thr 


ser 


val 


pro 


gin 


3931/1311 
















TCA ACG AGT 


GGC 


AGG 


CAA 


CAC 


ACT 


ACC 


TTT 


ser thr ser 


gly 


arg 


gin 


his 


thr 


thr 


phe 


3991/1331 
















TGT CTA CTT 


TGG 


GTT 


CTC 


AAA 


AAT 


GCA 


GAT 


cys leu leu 


trp 


val 


leu 


lys 


asn 


ala 


asp 


4051/1351 
















CTC TCA GTC 


TTG 


CAG 


CTA 


AAC 


CGG 


CTA 


TTA 


leu ser val 


leu 


gin 


leu 


asn 


arg 


leu 


leu 


4111/1371 
















GAG TAT AAA 


GGG 


AAA 


AAA 


GTG 


TTT 


GAA 


CGA 


glu tyr lys 


gly 


lys 


lys 


val 


phe 


glu 


arg 


4171/1391 
















GAC ATG AGA 


GCA 


AAG 


CTT 


GAA 


GAA 


GCT 


ATT 


asp met arg 


ala 


lys 


leu 


glu 


glu 


ala 


ile 


4231/1411 
















GTA CGG CGA 


AGC 


CGA 


GGA 


CAG 


CTC 


GAG 


AGA 


val arg arg 


ser 


arg 


gly 


gin 


leu 


glu 


arg 


4291/1431 
















GAA AAT TTG 


AGG 


TGG 


AGG 


AAA 


GAT 


ATG 


ACT 


glu asn leu 


arg 


trp 


arg 


lys 


asp 


met 


thr 


4351/1451 
















AAA TCA AGA 


GCA 


GAG 


ATT 


GAA 


CAC 


GAA 


GCA 


lys ser arg 


ala 


glu 


ile 


glu 


his 


glu 


ala 


4411/1471 
















AAC CTA ATC 


ATT 


TTA 


GAT 


ACA 


TTA 


GAG 


ATT 


asn leu ile 


ile 


leu 


asp 


thr 


leu 


glu 


ile 



FIG. 6A (4 of 6) 



4441/1481 








GTT GTT CAG 


ACC 


GTT 


TCT 


val val gin 


thr 


val 


ser 


4501/1501 








GTG CTA CTA 


CAC 


AGC 


ATG 


val leu leu 


his 


ser 


met 


4561/1521 








ACA CAG AGA 


GCC 


TTG 


GTT 


thr gin arg 


ala 


leu 


val 


4621/1541 








TGT GCT GAT 


TTA 


TGC 


CTC 


cys ala asp 


leu 


cys 


leu 


4681/1561 








TCA CAC CCC 


AGT 


GCC 


TCC 


ser his pro 


ser 


ala 


ser 


4741/1581 








TTT GCC AGG 


GTT 


AAA 


ATG 


phe ala arg 


val 


lys 


met 


4801/1601 








AAT TTT AAT 


GAA 


GAA 


TTC 


asn phe asn 


glu 


glu 


phe 


4861/1621 








GAT CTG GAA 


TTG 


AGG 


GAA 


asp leu glu 


leu 


arg 


glu 


4921/1641 








CAT ATG ATT 


CTT 


TCT 


GAT 


his met ile 


leu 


ser 


asp 


4981/1661 








ATT GAT CTA 


ATG 


TAC 


AGA 


ile asp leu 


met 


tyr 


arg 


5041/1681 








TTG CAG AAC 


ATG 


GCA 


GGC 


leu gin asn 


met 


ala 


gly 


5101/1701 








CTA GTC CAC 


TCA 


GCA 


GCA 


leu val his 


ser 


ala 


ala 


5161/1721 








CTT CCT GTG 


GGA 


TGT 


GTA 


leu pro val 


gly 


cys 


val 


5221/1741 








GTC TCA GAT 


GAT 


GTG 


GTA 


val ser asp 


asp 


val 


val 


5281/1761 








GAG TCA GGA 


CTT 


GTG 


GGA 


glu ser gly 


leu 


val 


gly 


5341/1781 








TAT GAA GCA 


GTT 


AAT 


GAA 


tyr glu ala 


val 


asn 


glu 


5401/1801 








GCA AAG AAA 


CTA 


TCC 


ACA 


ala lys lys 


leu 


ser 


thr 


5461/1821 








CAG AGT ACT 


GGC 


TGG 


GAG 


gin ser thr 


gly 


trp 


glu 


5521/1841 








AAG TTC GGG 


GAT 


TTG 


GAT 


lys phe gly 


asp 


leu 


asp 



4471/1491 
GTA ACG GAA TCC AAA GAG AGC 
val thr glu ser lys glu ser 

4531/1511 
GCC TGT AAC CAA AGT GCA GTT 
ala cys asn gin ser ala val 

4591/1531 
TCA AAG TTT CCT GAA CTC TTA 
ser lys phe pro glu leu leu 

4651/1551 
AGG CTT CTC CGA CAC TGT AGC 
arg leu leu arg his cys ser 

4711/1571 
CTT TAC CTA CTA ATG AGG CAA 
leu tyr leu leu met arg gin 

4771/1591 
CAG GTA CCA ATG TCA CTA TCC 
gin val pro met ser leu ser 

4831/1611 
TTA AGA CGT TCT CTA AAG ACT 
leu arg arg ser leu lys thr 

4891/1^31 
ACA ACA TTT CCT GAT CAG GTC 
thr thr phe pro asp gin val 

4951/1651 
ACT GTG AAA ATG AAG GAA CAC 
thr val lys met lys glu his 

5011/1671 
ATT GCC AAG GGT TAC CAG ACC 
ile ala lys gly tyr gin thr 

5071/1691 
AAG CAC TCA GAA CGA AGC AAT 
lys his ser glu arg ser asn 

5131/1711 
CTT GTT GCT GAA TAT TTG AGC 
leu val ala glu tyr leu ser 

5191/1731 
ACA TTT CAG AAT ATT TCA TCT 
thr phe gin asn ile ser ser 

5251/1751 
TCT CCA GAT GAA GAA GGT ATC 
ser pro asp glu glu gly ile 

5311/1771 
TTA CTG GAA CAA GCA GCT GCT 
leu leu glu gin ala ala ala 

5371/1791 
GTT TAC AAA GTA CTT ATT CCT 
val tyr lys val leu ile pro 

5431/1811 
ATT CAT GGT AAA CTT CAA GAA 
ile his gly lys leu gin glu 

5491/1831 
CGG ATG TTT GGC ACC TAT TTT 
arg met phe gly thr tyr phe 

5551/1851 
GAA CAA GAA TTT GTT TAC AAG 
glu gin glu phe val tyr lys 



ATT CTT GGT GGA GTG CTA AAA 
ile leu gly gly val leu lys 

TAT CTA CAA CAC TGT TTT GCT 
tyr leu gin his cys phe ala 

TTT GAA GAA GAG ACA GAG CAG 
phe glu glu glu thr glu gin 

AGT AGC ATC GGT ACA ATA CGG 
ser ser ile gly thr ile arg 

AAC TTT GAG ATT GGG AAT AAC 
asn phe glu ile gly asn asn 

TCC TTG GTG GGC ACA TCT CAG 
ser leu val gly thr ser gin 

ATA TTG ACA TAT GCT GAA GAA 
ile leu thr tyr ala glu glu 

CAG GAT CTG GTT TTC AAT CTC 
gin asp leu val phe asn leu 

CAG GAG GAT CCT GAA ATG TTG 
gin glu asp pro glu met leu 

TCT CCA GAG CGA TTG ACC TGG 
ser pro glu arg leu thr trp 

CAT GCT GAA GCT GCA CAG TGT 
his ala glu a.la ala gin cys 

ATG CTG GAG GAC CGG AAA TAT 
met leu glu asp arg lys tyr 

AAT GTT TTA GAA GAA TCT GCG 
asn val leu glu glu ser ala 

TGC TCT GGA AAA TAC TTT ACT 
cys ser gly lys tyr phe thr 

TCC TTC TCT ATG GCT GGC ATG 
ser phe ser met ala gly met 

ATT CAT GAA GCT AAT CGG GAT 
ile his glu ala asn arg asp 

GCA TTC AGC AAA ATT GTT CAT 
ala phe ser lys ile val his 

CGT GTT GGT TTT TAT GGA ACC 
arg val gly phe tyr gly thr 

GAG CCT GCA ATA ACC AAA CTT 
glu pro ala ile thr lys leu 
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5581/1861 
















5611/1871 
















GCA GAG ATA 


TCT 


CAC 


AGA 


TTG 


GAG 


GGA 


TTT 


TAC GGA GAA 


AGA 


TTT 


GGA 


GAG 


GAT 


GTG 


GTT 


ala glu ile 


ser 


his 


arg 


leu 


glu 




phe 


tyr gly glu 


arq 


phe 


qly 


qIu 


asp 


val 


val 


5641/1881 
















5671/1891 
















GAA GTA ATC 


AAA 


GAC 


TCT 


AAT 


CCT 


GTA 


GAC 


AAG TGT AAA 


TTA 


GAT 


CCT 


AAC 


AAG 


GCA 


TAT 


glu val ile 


lys 


asp 


ser 


asn 


pro 


val 


asp 


lys cys lys 


leu 


asp 


pro 


asn 


Ivs 


ala 


tvr 


5701/1901 
















5731/1911 
















ATT CAG ATT 


ACC 


TAT 


GTG 


GAG 


CCA 


TAC 


TTT 


GAC ACA TAT 


GAG 


ATG 


AAG 


GAC 


AGA 


ATC 


ACC 


ile gin ile 


thr 


tyr 


val 


glu 


pro 


tyr 


phe 


asp thr tyr 


glu 


met 


lys 


asp 


ara 


ile 


thr 


5761/1921 
















5791/1931 
















TAT TTC GAC 


AAA 


AAT 


TAC 


AAT 


CTT 


CGT 


CGA 


TTC ATG TAC 


TGT 


ACA 


CCC 


TTT 


ACT 


TTA 


GAT 


tyr phe asp 


lys 


asn 


tyr 


asn 


leu 


arg 
— > 


arg 


phe met tyr 


cys 


thr 


pro 


phe 


thr 


leu 


asp 


5821/1941 
















5851/1951 
















GGC CGT GCC 


CAT 


GGG 


GAA 


CTT 


GAT 


GAA 


CAA 


TTC AAA AGG 


AAG 


ACC 


ATT 


CTG 


ACT 


ACG 


TCT 


gly arg ala 


his 


gly 


glu 


leu 


his 


glu 


gin 


phe lys arg 


lys 


thr 


ile 


leu 


thr 


thr 


ser 


5881/1961 
















5911/1971 
















CAT GCC TTT 


CCT 


TAT 


ATT 


AAA 


ACA 


AGG 


GTC 


AAT GTC ACT 


CAT 


AAA 


GAA 


GAG 


ATC 


ATC 


TTA 


his ala phe 


pro 


tyr 


ile 


lys 


thr 


arg 


val 


asn val thr 


his 


Ivs 


qIu 


qIu 


ile 


ile 


leu 


5941/1981 
















5971/1991 
















ACA CCA ATT 


GAA 


GTT 


GCT 


ATT 


GAG 


GAC 


ATG 


CAG AAA AAG 


ACA 


CAG 


GAG 


TTG 


GCA 


TTT 


GCA 


thr pro ile 


glu 


val 


ala 


ile 


glu 


asp 


met 


gin lys lys 


thr 


gin 


qIu 


leu 


ala 


phe 


ala 


6001/2001 
















6031/2011 
















ACA CAT CAG 


GAT 


CCC 


GCA 


GAC 


CCC 


AAA 


ATG 


CTT CAG ATG 


GTA 


CTC 


CAG 


GGA 


TCT 


GTA 


GGC 


thr his gin 


asp 


pro 


ala 


asp 


pro 


lys 


met 


leu gin met 


val 


leu 


gin 


qIv 


ser 


val 


q1 V 


6061/2021 
















6091/2031 
















ACC ACA GTG 


AAT 


CAG 


GGG 


CCT 


TTG 


GAA 


GTT 


GCC CAG GTT 


TTT 


CTG 


TCT 


GAA 


ATA 


CCT 


AGT 


thr thr val 


asn 


gin 


gly 


pro 


leu 


glu 


val 


ala gin val 


phe 


leu 


ser 


glu 


ile 


pro 


ser 


6121/2041 
















6151/2051 
















GAC CCA AAG 


CTC 


TTC 


AGA 


CAT 


CAT 


AAT 


AAA 


CTG CGA CTC 


TGC 


TTT 


^^^^^^ 


GAT 


TTT 


ACT 


AAA 


asp pro lys 


leu 


phe 


arg 


his 


his 


asn 


lys 


leu arg leu 


cys 


phe 


lys 


asp 


phe 


thr 


lys 


6181/2061 
















6211/2071 
















AGG TGT GAA 


GAT 


GCC 


TTA 


AGA 


AAA 


AAT 


AAG 


AGC TTA ATT 


GGG 


CCG 


GTT 


CAA 


AAG 


GAG 


TAT 


arg cys glu 


asp 


ala 


leu 


arg 


lys 


asn 


lys 


ser leu ile 


gly 


pro 


val 


gin 


lys 


glu 


tyr 


6241/2081 
















6271/2091 
















CAA AGG GAA 


TTG 


GGG 


AAA 


CTA 


TCT 


TCG 


CCT 


TAA 
















gin arg glu 


leu 


gly 


lys 


leu 


ser 


ser 


pro 


OCH 

















AGAGGCCCTACAGCCCTAGATCACAGAAAGTCCCTCAGTTATCCAAGCCAGTATTGCTTGTCCCCTGCCACAGAGATTCC 
TTCAGTCGAATGAGCTTTCGCAAAATGGATCTCTAAACTGAATGCACTTGTTTTATTCATCTGCAAAGAGCCATGTATTC 
AACATCGAGTGTGAAAAGATCTATTGGAAACCAACATGGAATGGAATTCTGGAAATTATTATTCATTGAAGAATGCAGTG 
GCCAAGAAAATATCAAATGTAGATTGTTAACGCTTGAGAATCATGGCTATGGTTTCTAATGTTCGGGTAACAAGCTGTTA 
TCTTTTAAGACATTTTAATGACTCAAAGGTACACTATACATTTACCATTATTTATACCATAGCTAAGGTTAAAAATTTAT 
TCACTTTAAGTTCGTATTTTTTAATTTATATCACCATTTATAGATTCATTTTGGACCCATTTTAAATGTAGTAATGCTTA 
TTTTAAAGGTACTAAAAAATATGTGAATGTTTACCTCGTGCGCGCCAGGGCCTC 
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Allelic Variations: single nucleotide changes (polymorphisms) between hCLASP-3 
cDNAs and/or genomic sequences 



Isoform 


Difference 


Nucleotide 
position 


Consequence 


1 


polymorphism 


318 


A to G; missense 


2 


polymorphism 


323 


AtoG;Glu to Gly 


3 


polymorphism 


2187 


T to C; missense 


4 


polymorphism 


3165 


T to G; Asp to Glu 










Allelic Variations: Alternative Exon usage 


Isoform 


Difference 


Nucleotide 
position 


Consequence 


1 


Exon deletion 


2768-2860, 
inclusive 


In frame deletion — 33 amino 
acids removed 











FIG. 6B 



31 n 



(LI 



I 



to 

UH 



00 



FIG. 6C 



Exon 60270 - 20370 

CGCCGCAGCCGCCGCCGCC GTCGCCGTCGCCGCAGCAGCCATGGCCGAGCGCCGCGCCTTCGCCCAGAAGATCAGCAG GT 
AAATATCCGGCGTGGGGCGC 



Exon 85360 - 85510 

GTTTTTGCTTTCTTCATTGTA GAACGGTGGCAGCCGAAGTTAGGAAGCAGATCTCCGGACAATATAGTGGTTCTCCCCAA 
CTGCTCAAAAACCTTAATATTGTTGGCAATATATCCCATCACACCACAGTAAGTAACGTATTCAAAATATA 



Exon 94500 - 94720 

TCTTATCCCAACTTTTTACAAAGGT GCCCCTTACCGAAGCAGTAGATCCAGTGGATTTGGAAGATTACCTCATTACTCAT 
CCTTTGGCTGTGGATTCTGGGCCTTTACGGGATTTGATTGAATTTCCTCCAGATGATATTGAAGTTGTTTATAGTCCTCG 
GGACTGCAGAACTCTTGTTTCAGCTGTACCTGAAGAAAGGTAAGGAGACATTGACTTATT 



Exon 94870 - 94980 

TATTTTCCTTTTTAAAATAG TGAAATGGATCCACATGTTAGAGACTGTATAAGAAGTTATACAGAAGACTQGGCAATTGT 
CATCAGAAAGTAAGTTATATGTTTATTACAA 



Exon 100110 - 100290 

ATTTATTTAACTTTTTTTCTTTAATAG ATATCATAAATTGGGAACAGGATTTAATCCCAATACATTAGATAAACAGAAAG 
AAAGGCAAAAAGGTTTGCCAAAACAAGTTTTTGAATCTGATGAAGCTCCAGATGGCAACAGCTACCAGGATGATCAAG TA 
ATACTTTTATTCTTAAATAA 



Exon 100340 - 100600 

ATATTTAATGTTTTGCATGACAGGATGACC TTAAAAGACGTTCAATGTCAATAGATGATACCCCAAGGGGTAGCTGGGCC 
TGTAGTATCTTTGACTTGAAAAATTCACTTCCTGATGCTTTGCTTCCCAATTTACTTGATCGAACTCCAAATGAAGAAAT 
AGACCGTCAGAATGATGACCAAAGGAAATCAAACCGTCACAAAGAACTTTTTGCTTTGCATCCATCACCAGATGAGG TAT 
AGATGTTTGCATATAAAGAA 




Exon 100880-101020 

TTTTGGTGTTGCTTTTCAATTTGTAGG AAGAACCAATAGAACGGCTTAGTGTTCCTGATATACCCAAAGAACATTTTGGT 
CAAAGACTTCTTGTAAAATGCTTATCACTCAAGTGAGTATTTATTTCTTTTACTTACAACT 



Exon 112010 - 112120 

TTTTTCTTCATAAAGGT TTGAAATTGAAATTGAACCCATTTTTGCAAGTTTGGCTTTATATGATGTCAAGGAAAAGAAAA 
AGGTAAGATTATATAATTTGACCATAGTTAT 



Exon 113680 - 113880 

AAGTTTAACATACTAATATTTTTTAG ATTTCAGAAAACTTTTATTTTGACCTTAATTCTGAGCAGATGAAAGGGTTGTTA 
CGTCCACATGTACCACCTGCTGCCATTACTACCCTGGCAAGATCAGCAATTTTTTCTATCACTTATCCTTCCCAAGATGT 
TTTTCTTGTAATAAAGGTGAGAATAATGTTAAATATATTTG 



Exon 115020-115160 

TTAATCTTAACTTTTTTTGCCTTTGACAG CTAGAAAAAGTCCTACAGCAAGGAGACATTGGAGAGTGTGCAGAACCATAT 
ATGATTTTCAAAGAAGCAGATGCCACCAAGGTAGAATGTTATGCTTCTCATTTCCGCCAC 



Exon 117200 — 117410 

ATGTATAAAGTTCTGTTTTGCAG AATAAAGAAAAACTGGAGAAACTGAAGAGTCAAGCAGATCAGTTTTGCCAAAGACTT 
GGGAAATATCGCATGCCTTTTGCTTGGACTGCAATCCATTTAATGAATATTGTTAGCAGTGCTGGGAGTTTGGAAAGAGA 
TTCTACAGAAGTAGAAATCAGTACTGGAGGTAAGAGTGTTTCATACAAAAC 



Exon 123200 - 123396 

AAAATGAATTTTTTTTTTAATTCTTTTGTAG AACGAAAAGGGTCTTGGTCAGAGAGGAGGAATTCTAGTATTGTTGGCAG 
ACGATCACTTGAAAGGACAACAAGTGGAGATGATGCTTGTAACTTGACGAGCTTTCGACCAGCTACTCTCACAGTGACAA 
ATTTTTTTAAGCAGGTATTGTTCTGTCATGTAGGAATTTT 




(Next part of CIjASP, starting 

GAAGGAGACCGCTTAAGTGATGAAGATCTCTACAAATTCCTTGCTGATATGAGAAGGCCATCTTCTGTCTTACGGCGACT 
AAGACCTATTACAG ) 

Exon 5560 - 5710 

CTTTTTCCTCTATTATTGAAATCAGGAAGGAGACCGCTTAAGTGATGAAGATCTCTACAAATTCCTTGCTGATATGAGAA 
GGCCATCTTCTGTCTTACGGCGACTAAGACCTATTACAGGTATTTAAAAATTTTGAGTAGAAATGGTTGCA 



Exon 6680-6900 

TTACATTGTTTTTTAATATATAATTTGCAG CTCAGCTCAAGATAGACATTTCTCCCGCACCTGAAAATCCCCATTATTGC 
CTAACTCCGGAGCTGCTTCAAGTGAAGCTTTACCCTGACAGTAGAGTTAGACCTACCAGAGAAATCTTAGAGTTTCCCGC 
AAGGGATGTTTATGTTCCAAACACTACTTACAGGTAAGAGATTTTAATTTGGAGAATTCTG 



Exon 38920 - 39075 

GTATTTACTATCATCTATACTGTTGCTTTCACAG AAATCTTCTCTACATATACCCTCAGAGTCTTAATTTTGCCAATCGT 
CAAGGTTCTGCTAGAAATATAACAGTGAAAGTCCAGTTTATGTATGGAGAGGATCCAAGCAATGCCATGCCGGTAA 




GTAAGCTGCAGAGTACATGCAAAGTCCTTTCAGACACTTTGGCAACTAGTGAGTCATGT 
ATACAAAGAACTATTCAAAGCAAAATGTGGCCAGTTCTCTAAGAGAGTAGTGAAAAGAT 
GTTTGGAGAACAGAGATAAAGGCATGCAACATCTGGCTGGGAGTGTGAGAGCAGACAAC 
CAATTCAGGGACTATCAAACACCCAGGACAAGCTTAAGTACAGGATGAATAATGACAGT 
AACAGTGATAATAATAACAACAGCTGACACACATCGTACTTACCATGAACCAGGCCCTG 
TTCTAAGAACTTTATATTAATCAATTCAATAAATCTTCACAACACCCTTTCAAGTAGAT 
ATAATTATTTTCCTCCTTTTAGAGATGAAGTTAAGTAGCTTGCCTAAGGTTACAGAGTA 
AGAAAGTGGCAAAATAAGTCTGGGTTCCTATCACAGAGGGTCTTAAAAGCCAGGTACAG 
TGATAGCCTTGACTTACAGTCTGTGAGTACCTGGTTCATCTTTCCCAATGGACTCAAAC 
ATTCTGAGAGCAGGTCTGTGTCTAGTTCACCTTCGTGTTTTTTAGAACACCTAGGTGGG 
CTCTATACTTAGGTAGAAGCCAAATATCTGTCCTCTTTGAATAGACTATCTTTTCCTTT 
AGTAAACAAGCATCTGAAAACAGTGTGTTACTGGAGGGCGCCTTGGCAAGAGCCTGTGG 
TACTAGCTACTCCTCCAGAGGCTGAGGCAAGAGGATTGCTTGAGCCTAGGAGTTTTAGG 
CTGCAGTGCCCTATGCATTCCATGCTTGAGCAACACAGGGGGATTCTGTCTAAAATAAA 
TAAATCAATAAATAAAAGCAGTGTGTTACTGCAATGAGCATTGTTATATAAATACACAG 
TCTTCCACAGAAGTTACT^GCACTAGACAGGAAGTTATGGATCTGACTAATTGAGGCTTT 
CACCTCTGACTCAGTGTGTGACTCTGGCAAGTGACCTAATGGCTCCAAGCTACAGATTT 
TCCAATGGTAAAAGAGAGGAAAACAGGTATCCAACATATTCCACCAGGACATACTGAGA 
ATATAATAATAACAATAAATAAATACACATTTTAAGTTCTTGCAATACTTAAGAGGAAA 
GGTACTACAAGAAACCAAGATATTTTTCTTAGTGTCACCATAATTTCAGTATCAACAGA 
TATGCATTAGGAGCCAAATATTCTAGACATTATGGGATTACAACAGAACAGATGAAAAC 
AAATCTCTGCTAGTCTCTGTTTTATAGGCTATTATGGACAAGGTCAGTTTCAGTGGTCA 
ACACAGAGCTTATTTAACAGAACATCAAAATGGGGGTGAACGTTAGAGCTCACCTGGAT 
TTTATATATATATATATATATATATATATATATATATATATATGTATGAGAGACAGAGA 
GAGATCCAAAAGGGGTTAATAATACGGAAACAACTGTCCTCAAAGAAAAGAACCGGCTC 
TGGGTTTGAGTCTCAGTGGAAGAGTTTACCATCTGTGCGACCTGGGGCCAGTCACTTAG 
CTTTTCCAAGCCAATTTCCTTCCCTCTAAAATAGTGATAGTAACGATCTACTTCAAAGC 
GCTCGTGCTTGAAAAACTTTAATCCAGTGGCTGGCTAAGCACCAGAAGTCAACTATTAC 
TATTGTTATCGCTGCAGGAGCAAGGTATTTTCTGGCTCTTTCTCTATAAGAAAAACCAC 
TGTTTCTCCAGTTAGGCTGCATTCATGAGGCCGAAGCAGAAACCCAAGTGCTTTAAAAA 
GCTCGGAGACCGGTGCTGCAGAAACATGAACCCAGTCATCCAGGGCTTTGGTTAAAGCA 
* CAGCCCCCTCGGCAAAGCCCACCTTTCATTTCCTTCCCTTCCATTTCGGGAAAGCCAAG 
CTCCCCCTACGCGTTCGGTTATCTTATTTTCTTGCCTCGCCAGGTCGCTGGCCTGTCTC 
TGGCTTCTCTCCGCGACCTCTCGGTCGTGCAGGCTCCGCGGCAGCGAAGCGGCTGGGGC 
TGGCGCCTCGCTTCCTCAGCGCTCCATTCTTCCCCTCGGCTCCCGCCGGCCGCAGCCGC 
CTTCCGCAGCCGGGGTTCCCGCCGGGATTGACGCGCTGGGGGAGGAGCGGTTTCTCGTT 
GCGCGCCTCTAAGGAACATTACGGCAGGGCTCGTTCCTGGCTCCGGCCGCCAGCCCCAG 
CCTCCCAGGTCCGGAGCCCGGACTGGCGGAGGCCGCGAGGGAGGGAGCACGAGCGAGGA 
GGCACGCGCCCGCCGGTCCGCACCCGGCCCGCCGCAGCCGCCGCCGCC GTCGCCGTCGC 
CGCAGCAGCCATGGCCGAGCGCCGCGCCTTCGCCCAGAAGATCAGCAG 
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MFPMED I S I S VIGRQRRTVQ 20 

. MTHLNSLDVQIiAQELG 16 

MAERRAFAQKISRTVAAEVRKQISGQYSGSPQLLKNLNIVG 4 1 

MLLFPYDDFQTAILRRQGRYICS 23 

MAASERRAFAHKINRTVAAEVRKQVSRERSGSPHSSRRCSSSL 4 3 

MSFRGKVFKREPSEFWKKRRTVRRVIQEEFHRFSSQEKPRLLEPLDYETVIEELEKTYRN 6 0 



hCLASP4 STVPEDAEKRAQSLFVKECIKTYSTDWHWNYK 53 

hCLASPS DFT 19 

hCLASP3 N ISHHTTVPLTEAVDPVDLEDYLITHPLAVDSGPLRDLIEFP 83 

hCIiASP2 TVPAKAEEEAQSLFVTECIKTYNSDWHLVNYK 55 

hCLASP? G VPLTEWEPLDFEDVLLSRPPDAEPGPLRDLVEFP 7 9 



hCLASPl DPLQDLLFFPSDDFSAATVSWDIRTLYSTVPEDAEHKAENLIiVKEACKFYSSQWHWNYK 12 0 
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YEDFSGDFRMLPCKSLRPEKIPNHVFEIDEDCEKDED SSSLCSQKGGVIKQG 105 

DDDLDWFTPKECRTLQP-SLPEEGVELDPHVR DCVQTYIREWLI 63 

PDD I E WYS PRDCRTLVS - AVPE E - S EMD PHVR DC I RS YTEDWA I 12 6 

YEDYSGEFRQLPNKWKLDKLPVHVYEVDEEVDKDED AASLGSQKGGITKHG 107 

ADDLELLLQPRECRTTEP - GI PKD - EKLDAQVR AAVEMYIEDWVI 122 

YEQYSGDIRQLPRAEYKPEKLPSHSFEIDHEDADKDEDTTSHSSSKGGGGAGGTGVFKSG 18 0 
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hCLASP2 
hCLASP7 
hCLASPl 



WLHKANVNSTIT- - VTMKVFKRRYFYLTQLPDGSYILNSYKDEKNSKESK-GCIYLDACI 162 
VNRKNQGSPEIC- -GFKKTGSRKDFHKT-LPKQTFESETLECSEPAAQA- -GPRHLNVLC 118 
VIRKYHKLGTGF- -NPNTLDKQKERQKG- LPKQVFESDEAPDGNSYQDDQDDLKRRSMS I 183 
WLYKGNMNSAIS- -VTMRSFKRRFFHLIQLGDGSYNLNFYKDEKISKEPK-GSIFLDSCM 164 
VHRRYQYLSAAY- -SPVTTDTQRERQKG-LPRQVFEQDASGDERSGPEDSNDSRRGSGSP 179 
WLYKGNFNSTVNNTVTVRSFKKRYFQLTQLPDNSYIMNFYKDEKISKEPK-GCIFLDSCT 23 9 
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DWQCPKMRRHAFELKMLDKYSHYLAAETEQEMEEWLITLKKIIQINTDSLVQEKKETVE 222 

DVSGKGPVTACDFDLRSLQPDKRLENLLQQVSAEDFEKQNEEARRTN RQAE 16 9 

DDTPRGSWACSIFDLKNSLPDALLPNLLDRTPNEEIDRQNDDQRKSN RHKE 2 34 

GWQNNKVRRFAFELKMQDKSSYLLAADSEVEMEEWITILNKILQLN FEAAMQEK 219 

EDTPRS SGASS I FDLRNLAADSLLPSLLERAAPEDVDRRNETLRRQH RPPA 23 0 

GWQNNRLRKYAFELKMNDLTYFVLAAETESDMDEWIHTLNRILQISPEGPLQGRRSTEL 2 99 

* - * . 
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TAQDDETSS QGKAENIMASLERSMHPELMKYGRETEQLNKLSRGDGRQNLFSFDSE 2 78 

LF AL YP S VD EEDAVE I RP VPE C PKEHLG N RI LVKLLTLKFE IE 212 

LFALHPSPD EEEPIERLSVPDIPKEHFG QRLLVKCLSLKFEIE 2 77 

RNGDSHEDD EQSKLEGSGSGLDSYLPELAKSAREAEIK LKSESRVKLFYLDPD 2 72 

LLTLYPAPD EDE AVERCS RPE P PREHFG QRI L VKCLSLKFE IE 2 73 

TDLGLDSLDNSVTCECTPEETDSSENNLHADFAKYLTETEDTVKTTRNMERLNLFSLDPD 3 5 9 
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VQRLDFS GIEPDIKP-FEEKCNKRFLVNCHDLTFNILGQIGDNAKGPPTNVEPFFI 333 

lEPLFAS lALYDVKERKKISENFHCDLNSDQFKGFLRAHTPSVAASSQARSAVFSV 2 68 

I E P I FAS LALYDVKEKKKI SENFYFDLNSEQMKGLLRPHVPPAAI TTLARSAI FS I 3 33 

AQKLDFS SAEPEVKS - FEEKFGKRI LVKCNDLSFNLQCCVAENEEGPTTNVEPFFV 32 7 

lEPIFGI LALYDVREKKKISENFYFDLNSDSMKGLLRAHGTHPAISTLARSAIFSV 329 

IDTLKLQKKDLLEPESVIKPFEEKAAKRIMIICKALNSNLQGCVTENENDPITNIEPFFV 419 
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NLALFDVKNNCKI SADFHVDLNPPSVREMLWGSSTQLASDGS P KGSSPESYIHGI AE 3 90 

TYPSSDIYLWKIEKVLQQGD IGDCAEPYTVIKESDG GKSKE-KIEKLKL 31? 

TYPSQDVFLVI KLEKVLQQGD 1 GECAEP YMI FKEADA TKNKE - KLEKLKS 3 82 

TLSLFDIKYNRKISADFHVDLNHFSVRQMLATTSPALMNGS GQSPSVLKGILHE 3 81 

TYPS PD I FLVI KLEKVLQQGD 1 SECCE P YMVLKEVDT AKNKE - KLEKLRL 3 ? 8 

SVALYDLRDSRKISADFHVDLNHAAVRQMLLGASVALENGNIDTITPRQSEEPHIKGLPE 4 ? 9 

*•* ••••• * • 

SQLRYIQQGIFSVTNPHPEIFLVARIEKVLQGNITHCAEPYIKNSDPVKTAQKVHRTAKQ 450 

QAESFCQR LGKYRMPFAWAPISLSSFFNVSTLEREVTDVDSWGRSPVGERRTLA 3 72 

QADQFCQR LGKYRMPFAWTAIHLMNIVSSAGSLERDSTEVEISTGERKGSWSERR 4 3 7 

AAMQYPKQGIFSVTCPHPDIFLVARIEKVLQGSITHCAEPYMKSSDSSKVAQKVLKNAKQ 441 

AAEQFCTR LGRYRMPFAWTAVHLANIVSSAGQLDRDSD SEGERRPAWTDRR 42 9 

EWLKFPKQAVFSVSNPHSEIVLVAKIEKVLMGNIASGAEPYIKNPDSNKYAQKILKSNRQ 53 9 



VCSRLGQYRMPFAWAARPIFKDTQGSLDLDGRFSPLYKQDSSKLSSEDILKLLSEYKKPE 510 

QSRRLSERALSLEENGVGSNFKTS TLSVSSFFKQEGDRLSDEDLFKFLADYKRSS 427 

NSSIVGRRSLERTTSGDDACNLTSFR-PATLTVTNFFKQEGDRLSDEDLYKFLADMRRPS 4 96 

ACQRLGQYRMPFAWAARTL FKDASGNLDKNAR FSAI YRQDSNKLSNDDMLKLLADFRKPE 501 

RRGPQ--DRASSGDDACSFSGFR-PATLTVTNFFKQEAERLSDEDLFKFLADMRRPS 4 83 

FCSKLGKYRRAFAWAVRSVFKDNQGNVDRDSRFSPLFRQESSKISTEDLVKLVSDYRRAD 5 99 



* . 



^ • * • • 



- -KTKLQIIPGQLNITVECVPVDLSNCITSSYVPLKPFE-KNCQNITVEVEEFVPEMTKY 567 
SLQRRVKSIPGLLRLEISTAPEIINCCLTPEMLPVKPFP-ENRTRPHKEILEFP- -TREV 4 84 
SVLRRLRPITAQLKIDISPAPENPHYCLTPELLQVKLYP-DSRVRPTREILEFP- -ARDV 553 
K-MAKLPVILGNLDITIDNVSSDFPNYVNSSYIPTKQFETCSKTPITFEVEEFVPCIPKH 560 
S LLRRLRPVTAQL KI D I S PAPENPHF CLS PELLH I KP YP - D PRGRPTKE I LE FP - - ARE V 54 0 
R- ISKMQTIPGSLDIAVDNVPLEHPNCVTSSFIPVKPFNMMAQTEPTVEVEEFVYDSTKY 658 



CYPFTIYKNHLYVYPLQLKYDSQKTFAKARNIAVCVEFRDSDESDASALKCIYGKPAGSV 
YVPHTVYRNLLYVYPQRLNFVN- -ICLASARNITIKIQFMCG-EDASNAMPVIFGKS 3GPE 
YVPNTTYRNLLYIYPQSLNFAN- -RQGSARNITVKVQFMYG-EDPSNAMPVIFGKS 3CSE 
TQPYTIYTNHLYVYPKYLKYDSQKSFAKARNIAICIEFKDSDEEDSQPLKCIYGRP 3GPV 
YAPHTSYRNLLYVYPHSLNFSS- -RQGSVRNLAVRVQYMTG-EDPSQALPVIFGKS 3CSE 
CRPYRVYKNQIYIYPKHLKYDSQKCFNKARNITVCIEFKNSDEESAKPLKCIYGKP 3GPL 

* * it . -k . * * *.. ★* it , , 



FTTO AYAWSHHNQNPEFYDEIKIELPIHLHQKHHLLFTFYHVSCEINTKGTTKKQDT^E 

FLQE VYTAVTYHNKSPDFYEEVKIKLPAKLTVNHHLLFTFYHISCQQ KQGAS VE 

FSKEAYTAWYHNRSPDFHEEIKVKLPATLTDHHHLLFTFYHVSCQQ KQNTPLE 

FTRSAFAAVLHHHQNPEFYDEIKIELPTQLHEKHHLLLTFFHVSCDNSSKGSTKKRDVVE 

FTREAFTPWYHNKSPEFYEEFKLHLPACVTENHHLLFTFYHVSCQP RPGTALE 

FTSAAYTAVLHHSQNPDFSDEVKIELPTQLHEKHHILFSFYHVTCDINAKANAKKKEALE 



* . * 



* * . * * 



* * • * 



TPVGFAWVPLLKDGRI I TFEQQLPVS ?\NLPPGYLNLNDAESRRQCNVD I KWVDGAKPLLK 
TLLGYSWLPILLNERLQTGSYCLPVALEKLPPNYSMHSAEKVPLQNPPIKWAEGHKGVFN 

TPVGYTWI PMLQNGRLKTGQFCLPVS LEKPPQAYS VLS PEVP LPG ^IKWVDNHKGVFN 

TQVGYSWLPLLKDGRWTSEQHIPVSANLPSGHLGYQELGMGRHYGPEIKWVDGGKPLLK 

TPVGFTWIPLLQHGRLRTGPFCLPVSVDQPPPSYSVLTPDVA LPG^^RWVDGHKGVFS 

TSVGYAWLPLMKHDQIASQEYNIPIArSLPPNYLSFQDSASGKHGGSDIKWVDGGKPLFK 

itmitmrirtitm^ » it * » ••4r* it • • 
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EVQAVSSVHTQDNHLEKFFTLCHSLESQVTFPIRVLDQKI 



FKSHLESTIYTQDLHVHKFFHHCQLIQS 

i: 

VEWAVSSIHTQDPYLDKFFALVNALDEtH 
ISTHLVSTVYTQDQHLHNFFQYCQKTES- 
VELTAVS S VHPQDP YLDKFFTLVHVLEE G 
VSTFWSTVNTQDPHVNAFFQECQKREKD- 



GS KE VPGEL I KYL KCLHAM 
S EMALEHELKLS 1 1 CLNS S 
LFPVRIGDMRIMENNLENELKSSISALNSS 

GAQALGNELVKYLKSLHAM 

AFPFRLKDTVLSEGNVEQELRASLAALRLA 
MSQSPTSNFIRSCKNLLNVE 



INCTMV-LLHIVSKCHEEGLDS YLRSFIKYS FRPEKP 

RNCLLASYVHYVFRLPEVQRDVPKSGAPTALLDPRSYHTYGRTSAAAVSSKLLQARVMSS 
RNSLLASYIHYVFRLPNTYPNSSSPG-PGGLGGSVHYATMARSAVRPASLNLNRSRSLSN 

VNVTRV- IIHWAQCHEEGLES HLRSYVKYA YKAEPY 

HCPQLAAYVHYAFRLPGTEPSLPDGAPP-* VTVQAATLARGSGRPASLYLARSKSISS 

TTVTRV-LPDIVAKCHEEQLDH SVQSYIKFV FKTRAC 



794 
715 
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787 
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E I Q VM I QFLP VI LMQLF R VLTNMTH EDD VP 824 

RLEPLVLFLHLVLDKLFQLSVQPMVIAGQTANFSQFAFESWAIANSLHNSKDLSJCDQHG 775 

QLEPWRFLHLLLDKLILLVIRPPVIAGQIVNLGQASFEAMASIINRLHKNLEGNHDQHG 84 0 

EGHVMIAFLPTILNQLFR VLT-RAT QEEVA 816 

SPEPLVAFSHHVLDKLVRLVIRPPIISGQIVNLGRGAFEAMAHWSLVHRSLEAAQDARG 82 7 

KI HAIMS FLP 1 1 LNQLF K VLVQNE EDE IT 916 

..*.*.* 



860 
835 
899 
852 
883 
952 



hCLASP4 
hCLASP5 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



SAPQAQLIH ETLATTMI AILKQS 883 

SNPDLAGTHSAADEEVKNIMSSKIADRNCSRMSYYCSGSSDAPSSPA 882 

SNPDISGTPTSPDDEVRSIIGSKGLDRSNSWVNTGGPKAAPWGSNPSPSAESTQAMDRSC 959 

VASE YKTVH EELTKSMTT I LKPS 875 

SNPDLAVAPGS VDDE VSRI LAS KLLHEELA - LQ 915 

KE RPVH EDLAKNVTGLLKSN 972 
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ADFLSINKLLKYS WFFFEIIAKSM 

APR PAS KKH FHEELALQ M WS TGMVKS M 

NRMSSHTETSSFLQTLTGRLPTKKLFHEELALQWWCSGSVRESALQQAWFFFELMVKSM 

ADFLTSNKLLRYS WFFFDVLIKSM 

WWSSSAVREAILQHA WFFFQLMVKSM 

DSPTVKHVLKHS WFFFAIILKSM 
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Cadhe rin C leavage 
ATYLLEENKI 
AQHVHNMDKRDS 



KLPpGQFFPETYHHVLHSLLLAIIPHVTIRYAEIPDE SRNVNYSLAS 

iRTRFSDRFMDDITTIVNWTSEIAALLVKPQKENEQAEKMNISLAF 

VHHLYFNDKLEAPjRKSF FPERFMDDIAALVSTIASDI VSRFQKDTEM VERLNI SLAF 

AQHLIENSKVKLLRNQR FPASYHHAAETWNMLMPHITQKFGDNPEA SKNANt SLAV 

ALHLLLGQRLDTPRKLRFPGRFLDDITALVGSVGLEVITRVHKDVEL AEHLNASLAF 

AQHLIDTNKIQLPRPQF FPESYQNELDNLVMVLSDHVIWKYKDALEE TRRATt SVAR 
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FLKRCLTLMDRGFIFNLINDYISGFSPKDP K VLAEYKFEFLQTICNHEHYIPLNL 

FLYDLLSLMDRGFVFNLIRHYCSQLSAKLSNL PTLISMRLEFLRILCSHEHYLNLNL 

FLNDLLSVMDRGFVFSLIKSCYKQVSSKLYSLPNPSVLVSLRLDFLRIICSHEHYVTLNL 

FIKRCFTFMDRGFVFKQINNYISCFAPGDP KTLFEYKFEFLRWCNHEHYIPLNL 

FLSDLLSLVDRGFVFjsLVRAHYKQVATRLQSSPNPAALLTLRMEFTRILCSHEHYVTLNL 

K TLCQ YKFDFLQEVCQHEHFI PLCL 
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Cadherin EC 
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PMAFAKPKLQR VQDS - - NLE YSLS DE YCKHHFLVGL LLRE IS 

FFMNADTAPTSP - - CPS I SSQNSSSCSSFQDQKI ASMFDLTSEYRQQHFLTGL LFTE LAA 
PCSLLTPPASPSPSVSSATSQSSGFSTNVQDQKIANMFELSVPFRQQHYLAGLVLTELiAV 

PMPFGKGRIQR YQDL- -QLDYSLTDEFCRNHFLVGLLLREVGT 

PCCPLSPPASPSPSVSSTTSQSSTFSSQAPDPKVTSMFELSGPFRQQHFLAGLLLTELjAL 
PIRSANI PDPLTP SES TQELHASDMPEYS VTNEFCRKHFLIGI LLRE VGF 



motif 
I 1060 



ALQDN 

ALDAEGEGI 



ye;irytaii 



sviknllikhafdtryqhknqqakiaqlylpfvgllleni3rl 
skIvqrkavsaihsllsshdldprcvkpevkvkiaalylplvgiildalp-- 
ildpdaeglfglhkkvinmvhnllsshdsdprysdpqikarvamlylpligiimetvp-- 

alqefr evrliaisvlknllikhsfddryasrshqariatlylplfgllienv2ri 

alepeaegafllhkkai savhsllcghdtdpryaeatvkarvaelylplls i ardtl p - - 
alqedq dvrhlalavlknlmakhsfddryreprkqaqiaslymplygmlldnmpri 



AGRDTLYSCA AMPN-S ASRDEFPCGFTSPANRGSLSTDKDTAYGS 

QL CDFTVADTRRYRTSGSD 

QLY DFTETHNQRGRPICIATDD- - 

NVRDVSPFPVNAGMTVKDESLALPA-VNPLVTPQKGSTLDNSLHKDLLGAISGIASPYTT 

RLH DFAEGPGQRSRLASMLDSDTE 

YLKDLYPFTVNTSNQGSRDDLSTNGGFQSQTAIKHANSVDTSFSKDVLNSIAAFSSIAIS 
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1157 
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FQ - NGHGI KREDSRGS L I PEGATGFPDQGNTGEN TRQS STRS S VSQYNRLDQ YE 1213 

EEQEGAGAINQNVALAIAGNNFNLKT SGI VLSSLPYKQYNMLNADT 12 08 

YESESGSMISQTVAMAIAGTSVPQLTR PGSFLLTSTSGRQHTTFSAES 13 24 

STPNINSVRNADSRGSLISTDSGNSLPERNSEKSNSLDKHQQSSTLGNSWRCDKLDQSE 1227 

GEGDIAGTINPSVAMAIAGGPLAPGSR ASISQGPPTASRAGCALSAES 124 9 

TVNHADSRASLASLDSNPSTNEKSSEKTDNCEKIPRPLALIGSTLRFDRLDQAE 132 7 
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IRSLLMCYLYIVKMISEDTLLTYWNKVSPQELINILILLEVCLFHFRYMGKRNIARraOA 
IRNLMICFLWIMKNADQSLIRKWIADLPSTQLNRILDLLFICVLCFEYKGKQSSDKV'STQ 
SRSLLICLLWVLKNADETVLQKWFTDLSVLQLNRLLDLLYLCVSCFEYKGKKVFER^SL 
IKSLLMCFLYILKSMSDDALFTYWNKASTSELMDFFTISEVCLHQFQYMGKRYIAR^JQEG 
£ RTLLACVLWVLKNTEPALLQRWATDLTLPQLGRLLDLLYLCLAAFEYKGKKAFER ENSL 
1 RSLLMCFLHIMKTISYETLIAYWQRAPSPEVSDFFSILDVCLQNFRYLGKRNIIR <:iAA 
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WLSKHFGIDR-- 
VLQKSRDVKAR 
TFKKSKDMRAK 
LGPIVHDRKS-- 



-KSQTMPALRNRSGVMQARLQHLSSLESS 

LEEALLRGEGARGEMMRRRAPGNDRFPGLNEN 

LEEAILGSIGARQEMVRRSRGQLERSPSGSAFGSQ 
QTLPVSRNRTGMMHARLQQLGSLDNS 



TFKKSLDMKAR LEEAILGTIGARQEMVRRSRERSPFGNPEN 

AFKFVQSTQNNGTLKGSNPSCQTSGLLAQWMHSTSRHEGHKQHRSQTLPIIRGKN 
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FTLNHS S TTTE A D I FHQALLEGNTATE VS LTVLDT I S 
LRWKKEQTHWRQANEKLDKTKAIELDQEALISGNLATEAHLI 

: I EHEALI DGNLATEANL I 



ENLRWRKDMTHWRQNTEKLDKSRAE 



FFfTQCFKTQLL 
-ALD 
-VTE 



ILDMQENIIQASS 



ILDTLEIVVQTVS 



LTFNHSYGHSDADVLHQSLLEANIATEVCLTALDTLSLFTLAFKNQLL 

- - VRWRKSVTHWKQTSDRVDKTKDEMEHEALVEGNLATEASLWLDTLEI I VQTVM- LSE 
- -ALSNPKLLQMLDNTMTSNSNEIDIVHHVDTEANIATEGCLTILDLVSLFrQTHQRQLQ 
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NNDGHNPLMKKVFDIHLAFLKNGQSEVSLKHVFASLRAFISKFPSAFFKGRVNMCAAFCY 1419 

CKDS LLGGVLRVLVNSLNCDQSTTYLTHCFATLRALI AKFGDLLFEEEVEQCFDLCH 14 2 5 

SKES ILGGVLKVLLHSMACNQSAVYLQHCFATQRALVSKFPELIiFEEETEQCADLCL 154 6 

ADHGHNPLMKKVFDVYLCFLQKHQSETALKNVFTALRSLI YKFPSTFYEGRADMCAALCY 14 31 

ARES VLGAVLKWLYSLGSAQSALFLQHGLATQRALVSKFPELLFEEDTELCADLCL 14 64 

QCDCQNSLMKRGFDTYMLFFQVNQSATALKHVFASLRLFVCKFPSAFFQGPADLCGSFCY 156 0 
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EVLKCCTSKISSTRNEASALLYLLMRNNFEYTKRKTFLRTHLQIIIAVSQLIADVALSGG 14 7 9 
QVLHHCSSSMDVTRSQACATIiYLLMR- - FSFGATSNFARVKMQVTMSLASLVGRAPDFNE 14 83 
RLLRHCSSSIGTIRSHPSASLYLLMR--QNFEIGNNFARVKMQVPMSLSSLVGTSQNFNE 16 04 
EILKCCNSKLSSIRTEASQLLYFLMRNNFDYTGKKSFVRTHLQVIISVSQLIADWGIGE 14 91 
RLLRHCGSRISTIRTHASASLYLLMR--QNFEIGHNFARVKMQVTMSLSSLVGTTQNFSE 1522 
EVLKCCNHRSRSTQTEASALLYLFMRKNFEFNKQKSIVRSHLQLIKAVSQLIADAG-IGG 1619 
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hCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



SRFQESLFIINNFANSDRPMKATAFPAEVKDLTKRIRTVLMATAQMKEHEKDPEMLIDLQ 
EHLRRSLRTILAYSEEDTAMQMTPFPTQVEELLCNLNSILYDTVKMREFQEDPEMLMDLM 
EFLRRSLKTILTYAEEDLELRETTFPDQVQDLVFNLHMILSDTVKMKEHQEDPEMLIDLM 
TRFQQSLSIINNCANSDRLIKHTSFSSDVKDLTKRIRTVLMATAQMKEHENDPEMLVDLQ 
EHLRRSLKTILTYAEEDMGLRDSTFAEQVQDLMFNLHMILTDTVKMKEHQEDPEMLIDLM 
SRFQHSLAITNNFANGDKQMKNSNFPAEVKDLTKRIRTVLMATAQMKEHEKDPEMLVDLQ 
::.** :: * :: : *. :*::* .:. :* *'.*_;. *****.** 

transmembrane 



YSLAKSYASTPELRKTWLDSMAKIHVKNGDFSEAAMCYVHVAALVAEFLHRKK 
YRIAKSYQASPDLRLTWLQNMAEKHTKKKCYTEAAMCLVHAAALVAEYLSMLEDH 
YRI AKGYQTS PE - RLTWLQNMAGKHSERSNHAEAAQCLVHS AALVAE YL SMLEDR 
YSLAKSYASTPELRKTWLDSMARIHVKNGDLSEAAMCYVHVTALVAEYLTRKG 
YRI ARGYQGS PDLRLTWLQNMAGKHAELGN HAEAAQCMVHAAALVAE YL ALLEDQ 
YSLANSYASTPELRRTWLESMAKIHARNGDLSEAAMCYIHIAALIAEYL 
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KRKGYWKVEKI 



LFPNGCSAFKKITPNIDEEGAMKEDAGMMD 

SYLPVGSVSFQNISSNVLEESWSEDTLSPDEDGV 

KYLPVGCVTFQNISSNVLEESAVSDDWSPDEEGI 

VFRQGCTAFRV I T PN I DEE AS MMED VGMQD 

RHLPVGCVSFQNISSNVLEESAISDDILSPDEEGF 

CTASLLSEDTHPCDSNSLLTTPSGGSMFSMGWPAFLSITPNIKEEGAAKEDSGMHD 
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IT AM 



VHYSEEVLLELLEQCVDGLWKAERYEIISEISKLIVPIYEKRREFEKLTQVYRTLHG 

CAGQYFTESGLVGLLEQAAELFSTGGLYETVNEVYKLVIPILEAHREFRKLTLTHSKLQR 
CSGKYFTESGLVGLLEQAAASFSMAGMYEAVNEVYKVLIPIHEANRDAKKLSTIHGKLQE 

VHFNEDVLMELLEQCADGLWKAERYELIADIYKLIIPIYEKRR 

CSGKHFTELGLVGLLEQAAGYFTMGGLYEAVNEVYKNLIPILEAHRDYKKLAAVHGKLQE 
TPYNENILVEQLYMCGEFLWKSERYELIADVNKPIIAVFEKQRDFKKLSDIYYDIHR 
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IT AM 



DOCK motif 



DOCK motif 



A YTKI LEVMHTKKRLLGT FFRV AF YGQS FFEEED 
AFDSIVNKDH--KRMFGTYFRyGFFG-SKFGDLDEQEF|V[YKEP[AITKLPEI 
AFSKIVHQSTGWERMFGTYFRVGFYG-TKFGDLDEQEFMYKEPIAITKLAEI 
DFFEDED 

aftkimhqssgwervfgtyfrvgfyg-ahfgdldeqefmykepisitklaei 
sylkvaewnsekrlfgryyrvafygqgffeeee|gkey|i|ykep|kltglsei 

* 



;keyi 



YKEPKLTGLSEI 



* * * * 



IT AM 

SLRLVKiJyG 173 9 

SHRLEAFra 17 50 

SHRLEGFYG 18 72 

SQRLLKLYS 1710 

SHRLEEFYT 1791 

SQRLLKLYA 1912 
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hCLASP4 
hCLASPS 
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hCLASP2 
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hCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



eiSfgtewki iqdsdkvnakeldpk5^hiqvI^^ 



ERFGEDWEVi: 



IT AM 
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QCPGAEPVEVIKDSTPVDKTKLDPNKAYIQITFVEPYFDEYEMKDRVTYFEKNFNLRRFM 

iKDSNPVDKCKLDPNpCAYIQITYVEPYFDTYEMKDRITYFDKNYNLRRFM 

)VTHVIPFFDEKELQERKTEFERSHNIRRFM 



DKFGSENVKMIQDSGKVNPKDLDSKYAYIQ 
ERFGDDWEIIKDSYPVDKSKLDSQKAYIQI|T|YVEP|YFDTYELKDRVTYFDRNYGLRTFL 
DKFGADNVKIIQDSNKVNPKDLDPKYAYIQVTYVTPFFEEKEIEDRKTDFEMHHNINRFV 



* . 
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IT AM 



DOCK motif 



FEAPYTLSGKKQGCIEEQCKRRTILTTSNSFPYVKKRIPINCEQQINLKPIDGATDEIKD 
YTTPFTLEGRPRGELHEQYRRNTVLTTMHAFJ^YIKl^RISVIQKEEFVLTPIEVAIEDMKK 
YCTPFTLDGRAHGELHEQFKRKTILTTSHAFPYIKTRVNVTHKEEIILTPIEVAIEDMQK 
FEMPFTQTGKRQGGVEEQCKRRTILTAIHCFPYVKKRIPVMYQHHTDLNPIEVAIDEMSK 
FCTPFTPDGRAHGELPEQHKRKTLLSTDHAFPYIK'tRIRVCHREETVLTPVEVAIEDMQK 
FETPFTLSGKKHGGVAEQCKRRTILTTSHLFPYVKKRIQVISQSSTELNPIEVAIDEMSR 



* . * 



•k -k 



Coiled-coil 



. * 
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KTAELQKLCSSTDVDMIQLQLKLQGWVSVQVNAGPLAYARAFLNDSQASKYPPKKVSELK 
KTLQLAVAINQEPPDAKMLQMVLQGSVGATVNQGPLEVAQVFLAEIPADPKLYRHHNKLR 
KTQELAFATHQDPADPKMLQMVLQGSVGTTVNQGPLEVAQVFLSEIPSDPKLFRHHNKLR 
KVAELRQLCSSAEVDMIKLQLKLQGSVSVQVNAGPLAYARAFLDDTNTKRYPDNKVKLLK 
KTRELAFATEQDPPDAKMLQMVLQGSVGPTVNQGPLEVAQVFLAEIPEDPKLFRHHNKLR 
KVSELNQLCTMEEVDMISLQLKLQGSVSVKVNAGPMAYARAFLEETNAKKYPDNQVKLLK 



* * * * 
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Coiled-coil 



•k . 



DMFRKFIQACSIALELNERLIKEDQVEYHEGLKSNFRDMVKELSDIIHEQILQEDTMHSP 
LCFKEFIMRCGEAVEKNKRLITADQREYQQELKKNYNKLKENLRPMERKIPELYKPIFR 

LCPKDFTKRCED ALRKNKS L I GPVQKE YQRELGKLS SP 

EVFRQFVEACGQALAVNERLIKEDQLEYQEEMKANYREMAKELSEIMHEQICPLEEKTS- 
LCFKDFCKKCEDALRICNKALIGPDQKEYHRELERNYCRLREALQPLLTQRLPQLMAPTP- 
EIFRQFADACGQALDVNERLIKEDQLEYQEELRSHYKDMLSELSTVMNEQITGRDDLSKR 



•k . 
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PDZ ligand 



WMSNTLHVFCAISGTSSDRGYGSPR |YAEV| - - 2 008 
VESQKRDSFHRSSFRKCETQLSQGS 2015 



VLPNSLHIFNAISGTPTSTMVHGMTSSSSW 1980 



- - PGLRNSLNRAS FRKADL 2 047 

GVDQTCTRVI S KATPALPT VS I S ^SAEV] 2180 



1799 
1810 
1932 
1770 
1851 
1972 



1859 
1870 
1992 
1830 
1911 
2032 



1919 
1930 
2052 
1890 
1971 
2092 



1979 
1990 
2090 
1949 
2030 
2152 
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